WO2023016167A1 - 一种虚拟形象视频通话方法、终端设备及存储介质 - Google Patents

一种虚拟形象视频通话方法、终端设备及存储介质 Download PDF

Info

Publication number
WO2023016167A1
WO2023016167A1 PCT/CN2022/104964 CN2022104964W WO2023016167A1 WO 2023016167 A1 WO2023016167 A1 WO 2023016167A1 CN 2022104964 W CN2022104964 W CN 2022104964W WO 2023016167 A1 WO2023016167 A1 WO 2023016167A1
Authority
WO
WIPO (PCT)
Prior art keywords
video call
avatar
user
network quality
virtual image
Prior art date
Application number
PCT/CN2022/104964
Other languages
English (en)
French (fr)
Inventor
王娅丽
周万富
Original Assignee
惠州Tcl云创科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 惠州Tcl云创科技有限公司 filed Critical 惠州Tcl云创科技有限公司
Priority to US18/579,842 priority Critical patent/US20240179272A1/en
Publication of WO2023016167A1 publication Critical patent/WO2023016167A1/zh

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation
    • G06T13/802D [Two Dimensional] animation, e.g. using sprites
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • H04N7/157Conference systems defining a virtual conference space and using avatars or agents
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T15/003D [Three Dimensional] image rendering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/172Classification, e.g. identification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/20Movements or behaviour, e.g. gesture recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • H04N7/147Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems

Definitions

  • the present invention relates to the field of mobile terminals, in particular to a virtual image-based video call method, terminal equipment and storage media.
  • the technical problem to be solved by the present invention is to provide an avatar video call method, device, terminal equipment and storage medium for the above-mentioned defects of the prior art.
  • a virtual image video call method wherein the method includes:
  • the current video call interface is switched to the avatar mapped with user behavior for display.
  • the virtual image video call method wherein, before the step of detecting the network quality index in the video call process, it includes:
  • the predetermined condition of the network quality index is set in advance, and when it is detected that the network state during the video call meets the predetermined condition, then the video call interface is controlled to be switched to the virtual image for display;
  • the avatar used for the video call is preset to be used when the network quality index meets the predetermined condition.
  • the avatar video call method wherein the step of judging whether the network quality index of the current video call satisfies a predetermined condition includes:
  • the network quality index includes network speed, packet loss rate and delay
  • the virtual image video call method wherein, when it is detected that the network quality index of the video call meets a predetermined condition, the step of controlling the acquisition of the pre-generated virtual image of the calling user includes:
  • the avatar video call method wherein the step of switching the current video call interface to display the avatar mapped with user behavior includes:
  • the virtual image video call method wherein the step of detecting the network quality index in the video call process includes:
  • the preset conditions are set by the user according to usage habits and usage experience.
  • the virtual image video call method wherein, before the step of controlling the acquisition of the pre-generated virtual image of the calling user, includes:
  • the avatar video call method further includes: if the call user does not have a corresponding preset avatar, collecting the appearance information of the call user according to the terminal camera and generating an algorithm containing the call user's appearance information.
  • the virtual image of the appearance characteristics of the user is stored, so that the calling user can call the corresponding preset virtual image when using the video call again later.
  • the avatar video call method wherein, after the step of switching the current video call interface to display the avatar mapped with user behavior, it includes:
  • a virtual image video call switching method wherein the method includes:
  • the avatar video call switching method wherein, before the step of detecting receiving or sending a video call request includes:
  • Pre-set the avatar for video calls or pre-set the algorithm to automatically generate the avatar according to the user's appearance characteristics.
  • the method for switching avatar video calls wherein the step of detecting that the video call is established successfully, and displaying the avatar mapped with user behavior on the video call screen includes:
  • Control the sound sensor to collect the user's voice, and recognize the mouth shape corresponding to the user's voice and the user's emotion when speaking through the voice recognition algorithm;
  • a terminal device wherein the terminal device includes a memory, a processor, and an avatar video call program stored on the memory and operable on the processor, and the processor executes the avatar video call During the program, the steps of any one of the avatar video calling methods are realized.
  • a computer-readable storage medium wherein an avatar video call program is stored thereon, and when the avatar video call program is executed by a processor, the steps of any one of the avatar video call methods are realized.
  • the present invention compares the network quality index with the predetermined condition of the preset network quality index according to the network quality index in the video call obtained by automatic detection, and when the comparison result shows that the current network quality is poor , switch the video call interface to an interface with an avatar displayed.
  • the interface displayed by the avatar can map the user's expressions, mouth shapes, body movements and other behaviors to the avatar, and it has a high image quality while maintaining a low data transmission volume, so that people can use it even when the network is poor. You can also get a clear and high-quality video call experience under the phone.
  • Fig. 1 is a flow chart of a specific implementation of the method for avatar video calling provided by the embodiment of the present invention.
  • Fig. 2 is a flow chart of a specific implementation of the method for switching an avatar video call provided by an embodiment of the present invention.
  • Fig. 3 is a schematic flowchart of a third embodiment of the present invention.
  • Fig. 4 is a schematic diagram of an internal structure of a terminal device provided by an embodiment of the present invention.
  • the directional indication is only used to explain the position in a certain posture (as shown in the accompanying drawing). If the specific posture changes, the directional indication will also change accordingly.
  • the embodiment of the present invention provides an avatar video call method.
  • the avatar video call method provided in this embodiment, when the network quality is detected to be poor during the video call, the avatar will be automatically called , and switch the current video call interface to an avatar for display; the behavior of the avatar is mapped to the current behavior of the user collected by the sensor of the mobile terminal. Because the transmission data stream required by the avatar is small, when the user is in a location with a poor network environment, he can also perform high-quality, high-frame-rate video calls through the avatar to obtain a better video call experience. Moreover, the setting of the avatar varies from person to person, and can be a pre-set template avatar, or a pre-generated avatar containing user characteristics.
  • an embodiment of the present invention provides an avatar video call method, which can be used in a mobile terminal.
  • the method described in the embodiment of the present invention includes the following steps:
  • Step S100 detecting network quality indicators during the video call
  • the detection method includes real-time detection and detection at fixed time intervals.
  • step of detecting the network quality in the video call process includes:
  • the predetermined condition of the network quality index is set in advance, and when it is detected that the network state during the video call meets the predetermined condition, then the video call interface is controlled to be switched to the virtual image for display;
  • the avatar used for the video call is preset to be used when the network quality index meets the predetermined condition.
  • the predetermined condition of the network quality index is set in advance, and the predetermined condition of the network quality index represents that when the parameters of the current network quality index meet the predetermined condition, it is considered that the current network quality is poor, and then the currently displayed video call interface is controlled to be switched
  • the network quality index is a parameter used to reflect the current network stability and network signal strength, including but not limited to packet loss rate, delay, network speed and other parameters.
  • the predetermined condition is to judge whether the current network speed is lower than the first value and/or whether the current packet loss rate is higher than the second value and/or detect whether the current delay is higher than the third value.
  • the first value, the second value and the third value are preset by the manufacturer or obtained through the user's own setting. When any one or a combination of any of them is satisfied, it is judged that the network quality is poor.
  • the setting of the avatar can be provided by big data, so that the user pre-selects and stores the avatar he likes, and when it is detected that the network quality index meets the predetermined condition, the pre-selected avatar is directly called for display, or it can be displayed according to the appearance of the user.
  • Features Automatically generate a unique avatar and set it up for use.
  • the mobile phone A of user A presets the parameters used to indicate the network quality index, including network speed, packet loss rate and network delay, and it is set that when the network speed in the network quality index parameter is lower than 0.5MB/s , the packet loss rate is greater than 30%, and the network delay is greater than 200ms.
  • the mobile phone A of user A collects body appearance information through the camera of mobile phone A, including facial features, head size, neck and shoulders Size and other information, and automatically generate a virtual image with user appearance characteristics through the algorithm and pre-store it in mobile phone A, and set the priority to switch the virtual image for display when the network quality is detected to be poor during the video call.
  • mobile phone A detects the current network speed, packet loss rate and network delay in real time. For example, one of the detections shows that the network speed is 5MB/s and the packet loss rate is 5 %, the network delay is 50ms.
  • the mobile phone In order to further save the power consumption of the mobile phone and the accuracy of the network quality judgment, it can be set to detect the network quality index at a certain interval, and the interval time can be set by the user. Because each user has a certain awareness of the performance and signal strength of his mobile phone during use, and the network of some mobile phones will only deteriorate at a certain moment or in a short period of time, combined with the user's habits , the user may not think that the network fluctuation problem affects the normal use of mobile phones or video calls, so set the time interval for detecting network quality indicators to be longer; The network in the network is worse than that of other models of mobile phones, so as long as the environmental network fluctuates slightly, the network will deteriorate for a long time, so the user can set the network quality index acquisition interval to be shorter. , in order to promptly switch the video call interface to an avatar for display at the initial moment of network deterioration.
  • the preset conditions of the network quality index used to judge the network quality can also be adjusted by the user according to his own usage habits and experience. For example, in the process of using a video call, if you don’t feel that the video call interface is stuck or the image quality is seriously lowered, then you suddenly switch to an avatar, which means that the predetermined conditions are set incorrectly at this time, and the user can freely set the network through the corresponding setting interface. Speed, packet loss rate, network delay or other predetermined conditions of multiple parameters, so as to ensure that the video call world is switched only when the user needs to switch.
  • step S200 judging whether the network quality index of the current video call satisfies a predetermined condition
  • the mobile terminal compares the obtained network quality index with the preset predetermined condition of the network quality index, and judges whether the network quality index of the current video call satisfies the predetermined condition.
  • the predetermined conditions of the network quality index preset by user A's mobile phone A are that the network speed is lower than 0.5MB/s, the packet loss rate is greater than 30%, and the network delay is greater than 200ms, and any one of the network quality indicators is set
  • the condition is met, it means that the network quality index of the video call meets the predetermined condition.
  • the network speed is 5MB/s
  • the packet loss rate is 5%
  • the network delay is 50ms
  • the network speed, packet loss rate, and network delay do not meet the preset requirements.
  • the predetermined condition of the network quality index is that the network quality at this moment is normal; when it is detected at another moment that the network speed is 0.2MB/s, the packet loss rate is 16%, and the network delay is 105ms, then the network quality index in the network quality index is If one item of the speed meets the predetermined condition, it is judged that the current network quality is poor.
  • step S300 when it is detected that the network quality index of the video call satisfies a predetermined condition, control to acquire the pre-generated avatar of the calling user;
  • the network quality index meets a predetermined condition, that is, when it is detected that the current network speed is lower than the first value and/or the current packet loss rate is higher than the second value and/or the current delay is detected
  • the value is higher than the third value, the avatar of the calling user that is generated in advance and stored in the storage space of the mobile terminal is invoked.
  • the avatar representing the calling user that is generated in advance and stored in the storage space of the mobile terminal is invoked.
  • the pre-generated avatars include template avatars obtained by users from big data, such as male character templates, female character templates, or other interesting avatars including kittens, puppies, etc.; Collect user appearance information and generate a virtual image containing user appearance characteristics through algorithms.
  • the user appearance includes information such as the user's head shape, facial features, and body shape.
  • the face of the caller who is currently using the video call is recognized to check whether the user has preset in the phone. If there is a corresponding avatar, the preset avatar is called directly, and if there is no avatar, a template avatar is allocated for its use. Or further, based on the user's appearance information collected by the mobile phone camera, an algorithm is used to temporarily generate a virtual image containing the user's appearance characteristics, and store it in the mobile phone, so that the user can call his own virtual image when using the mobile phone again.
  • step S400 the current video call interface is switched to display the avatar mapped with user behavior.
  • the video call interface is switched to the avatar for display.
  • the user's behavior is collected through the mobile terminal camera and/or sound sensor, and the behavior includes facial expressions, body movements, shaking and turning of the head, and mouth movements obtained through voice recognition and analysis. And the user behavior is mapped to the avatar for display.
  • the screen transmitted by user A's mobile phone A to the peer end of the video call is switched to a screen containing the avatar.
  • the mobile phone A of the user A collects user behaviors through the front camera of the mobile phone A, specifically identifying the contours of the user's face and the contours of the nose, mouth, glasses, and eyebrows through an algorithm, and detecting changes in the area and shape of the contours.
  • the user closes the eyes, opens the mouth and other behaviors, and judges whether the user's head turns according to the shape and area of the user's face, and maps the collected and recognized user's behaviors to the virtual image, so that the virtual The behavior of the image is consistent with the user's actions, improving the call experience of the users on both sides of the call.
  • the method of collecting user behavior by mobile phones can be obtained only based on the sound.
  • the implementation method is to recognize the conversation content and tone of the user through the acquired audio, and further convert the conversation content and tone into mouth shapes and facial expressions Emoticons are mapped to the avatar for display.
  • the user behavior obtained by this method greatly reduces the processing performance requirements of the avatar video call on the mobile phone, and ensures the experience of both parties in the call on the basis of the loss of part of the avatar's mobility.
  • Users can also customize the user behavior collection process according to the performance of their mobile phones, including only collecting head rotation and shaking, and only collecting any one or more of eyes, eyebrows, nose, and mouth for mapping, maximizing Maintain the mobility of the avatar and the smoothness of the video call.
  • the step of switching the current video call interface to display the avatar mapped with the user behavior includes:
  • the detection method includes real-time detection and fixed time detection.
  • the former can switch the video call interface in time according to the user's network quality index, and the latter can reduce the demand for processing power of the processor and reduce energy consumption.
  • the predetermined condition is not met, it is determined that the current network quality is normal.
  • the packet loss rate is 10%
  • the network delay is 99ms during the video call using the avatar, no network quality index meets the predetermined conditions from this moment until 30s , it is judged that the network quality has returned to normal, and mobile phone A switches the avatar display to a normal video call interface.
  • the mobile terminal detects the network quality during the user's video call, and automatically switches the video call interface to an avatar interface for display when it is judged that the network quality is poor, and uses the mobile terminal camera and/or Or the sound sensor maps the user's behavior to the avatar in real time for display.
  • the user can still maintain clarity and smoothness even when the network quality is poor. call screen.
  • a virtual image video call switching method including the following steps:
  • Step A100 when receiving or sending a video call request is detected, detecting whether an instruction to use an avatar to make a video call is received;
  • the mobile terminal when the mobile terminal detects that a video call request is received or sent, it detects whether the user needs to use the avatar to make a video call.
  • Preset avatars for video calls or preset algorithms for automatically generating avatars based on user characteristics.
  • the mobile phone B detects whether it needs to use the avatar to make a video call, that is, detects whether an instruction to use the avatar to make a video call is received.
  • the detection method includes displaying an interactive window on the display screen of the mobile phone B whether to use the avatar to make a video call, or detecting whether the user B presets to use the avatar to make a video call by default.
  • preset avatars for video calls which can be downloaded and stored through big data, including male character templates, female character templates or Some interesting virtual images such as kittens, puppies, and even some non-biological images such as Coke and boxes with eyes and mouths.
  • the algorithm needs to cooperate with the camera of the mobile terminal to collect the user's appearance characteristics to generate avatars. Users can pre-generate and store avatars with their own appearance characteristics, or it can be During the virtual image video call using the mobile terminal, the virtual image containing appearance features is generated and displayed in real time.
  • step A200 when it is detected that an instruction to make a video call using the avatar is received, call or generate the avatar, and map the user's behavior to the avatar;
  • the avatar including the appearance characteristics of the user is called or generated, and the behavior of the user is mapped to the avatar.
  • the control calls the pre-selected and stored in the The avatar in the mobile phone B, or the avatar with the appearance characteristics of the user is generated through a pre-stored algorithm. Perform the steps in Embodiment 1 to map the user's behavior into the avatar.
  • the user can choose whether to use the avatar for the video call before the start of the video call, and the use of various styles of avatars makes the video call interesting, and according to the user's needs, the video call can be kept confidential. .
  • step S300 it is detected that the video call is established successfully, and the avatar mapped with user behavior is displayed on the video call screen.
  • the video or avatar sent by the peer user is received, and the local user also calls the avatar mapped with user behavior and displays it on the peer video call screen.
  • the user not only can the user set up and disable the avatar, but also apply for the other party to use the avatar to interact.
  • the avatar is clearer and smoother than the video method and saves network traffic.
  • a mobile terminal is taken as an example of a mobile phone, as shown in FIG. 3
  • a virtual image video call method in this specific application embodiment includes the following steps:
  • Step S10 start, enter step S11;
  • Step S11 the operator's business hall collects or re-records the portrait when opening the card, and proceeds to step S12;
  • Step S12 the operator uses the collected portraits to generate a virtual portrait with user appearance characteristics through an algorithm, and enters step S13;
  • Step S13 the operator stores the avatar into the SIM card or binds the avatar to the SIM card through the network server, and enters step S14;
  • Step S14 when it is detected that the user uses the SIM card to make a video call, according to whether the user needs to use the avatar, transmit the avatar preset in the SIM card to the opposite user, and enter step S15;
  • Step S15 the peer end of the video call receives the avatar data, and proceeds to step S16 and step S17;
  • Step S16 display the avatar received through the network, and proceed to step S20;
  • Step S17 according to the requirements of the opposite end of the video call, the video call number or its avatar can be stored and remarked;
  • the user can collect personal appearance characteristics in the business hall of the operator and generate an avatar with the same appearance characteristics as the user and establish a connection with the SIM card, and the virtual image can be used during a video call.
  • the avatar is transmitted to the display screen of the opposite end, which plays a role of privacy and confidentiality, and the avatar can be stored and noted like a name and a phone number.
  • User C activates a SIM card in the operator's business hall, and the operator's staff collects user C's appearance information, which includes at least the size of the head and facial features, and automatically generates a profile containing user C's appearance information based on the collected appearance information of user C. virtual image, and bind the SIM card and the virtual image by storing the virtual image of the user C into the SIM card or through a network server.
  • the mobile phone can obtain the avatar through the SIM card in addition to making calls through the SIM card.
  • the avatar can be freely switched for display according to the needs of user C.
  • the current network is not good enough to carry out high-definition video call transmission, then you can switch to the one with lower network speed requirements
  • the avatar continues to make a video call, or when user C receives a video call request from an unfamiliar person or wants to make an interesting video call request with a friend, the video call is made by manually switching to use the avatar to answer the call. call.
  • the peer end of the video call receives the avatar transmitted by the Internet, it will display the avatar of user C on the display of the terminal device, and further store the avatar of user C in the same way as storing a mobile phone number, and edit .
  • each user can have a unique avatar, and can freely choose to use video calls and avatar calls according to user needs such as poor network or privacy considerations, thereby improving user experience and security.
  • the present invention further provides a terminal device, the functional block diagram of which may be shown in FIG. 4 .
  • the terminal equipment includes a processor, a memory, a network interface, and a display screen connected through a system bus.
  • the processor of the terminal device is used to provide calculation and control capabilities.
  • the memory of the terminal device includes a non-volatile storage medium and an internal memory.
  • the non-volatile storage medium stores an operating system and computer programs.
  • the internal memory provides an environment for the operation of the operating system and computer programs in the non-volatile storage medium.
  • the network interface of the terminal device is used to communicate with external terminals through a network connection. When the computer program is executed by the processor, a virtual image video call is realized.
  • the display screen of the terminal device may be a liquid crystal display screen or an electronic ink display screen.
  • a terminal device includes a memory, a processor, and an avatar video call program stored on the processor and operable on the processor.
  • the processor performs the following steps:
  • the current video call interface is switched to the avatar mapped with user behavior for display.
  • step of detecting the network quality index in the video call process includes:
  • the predetermined condition of the network quality index is set in advance, and when it is detected that the network state during the video call meets the predetermined condition, then the video call interface is controlled to be switched to the virtual image for display;
  • the avatar used for the video call is preset to be used when the network quality index meets the predetermined condition.
  • the step of judging whether the network quality index of the current video call satisfies a predetermined condition includes:
  • the network quality index includes network speed, packet loss rate and delay
  • the step of controlling the acquisition of the pre-generated avatar of the calling user includes:
  • the step of switching the current video call interface to display the avatar mapped with user behavior includes:
  • the step of detecting the network quality index in the video call process includes:
  • the preset condition is set by the user according to usage habits and usage experience.
  • step of controlling the acquisition of the pre-generated avatar of the calling user includes:
  • the appearance information of the calling user is collected according to the terminal camera, and an avatar containing the appearance characteristics of the calling user is generated by an algorithm and stored, so that the The calling user can invoke the corresponding preset avatar when using the video call again subsequently.
  • step of switching the current video call interface to display the avatar mapped with the user behavior includes:
  • Nonvolatile memory can include read only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory.
  • Volatile memory can include random access memory (RAM) or external cache memory.
  • RAM is available in many forms such as Static RAM (SRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), Double Data Rate SDRAM (DDRSDRAM), Enhanced SDRAM (ESDRAM), Synchronous Chain Synchlink DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.
  • SRAM Static RAM
  • DRAM Dynamic RAM
  • SDRAM Synchronous DRAM
  • DDRSDRAM Double Data Rate SDRAM
  • ESDRAM Enhanced SDRAM
  • SLDRAM Synchronous Chain Synchlink DRAM
  • Rambus direct RAM
  • DRAM direct memory bus dynamic RAM
  • RDRAM memory bus dynamic RAM
  • a virtual image video call method, terminal equipment and storage medium including: detecting network quality indicators during the video call process; judging whether the network quality indicators of the current video call meet predetermined conditions; When the network quality index of the call satisfies the predetermined condition, control to acquire the pre-generated avatar of the call user; switch the current video call interface to the avatar mapped with user behavior for display. It aims to solve the problem that when the user is using a mobile terminal to make a video call, due to the deterioration of the network quality, the video call can only be maintained by reducing the image quality, resulting in a poor user experience.
  • the video call interface When it is detected that the network quality of the user is deteriorating during the video call, the video call interface is automatically switched to the preset avatar for display, and the high-definition and fluency display effect can be maintained even in the case of a poor network, which improves the user experience. User experience.
  • Pre-set multiple NFC landlines that support NFC functions and have NFC tags for placement on different workstations pre-set an NFC wearable device for each user, and establish each user's NFC wearable device with the number of the NFC landline Correspondence; when there is an incoming call, find the NFC wearable device corresponding to the extension number of the incoming call; through the NFC wearable device corresponding to the extension number of the incoming call, obtain the NFC landline at the nearest station to the NFC wearable device; control The incoming call is switched to the NFC wearable device corresponding to the extension number of the incoming call, and the NFC landline at the nearest station rings.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • General Health & Medical Sciences (AREA)
  • Oral & Maxillofacial Surgery (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Psychiatry (AREA)
  • Social Psychology (AREA)
  • Computer Graphics (AREA)
  • Telephone Function (AREA)

Abstract

本发明提供一种虚拟形象视频通话方法,包括:对视频通话过程中的网络质量指标进行检测;判断当前视频通话的网络质量指标是否满足预定条件;当检测到视频通话的网络质量指标满足预定条件,则控制获取预先生成的通话用户的虚拟形象;将当前视频通话界面切换为映射有用户行为的所述虚拟形象进行显示。

Description

一种虚拟形象视频通话方法、终端设备及存储介质
本申请要求于2021年08月09日提交中国专利局、申请号为202110908844.6、申请名称为“一种虚拟形象视频通话方法、终端设备及存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本发明涉及移动终端领域,尤其涉及的是一种基于虚拟形象视频通话方法、终端设备及存储介质。
背景技术
传统的视频通话中,当网络信号较差或网络信号波动较大时会导致用户的视频通话卡顿,影响使用体验。
技术问题
传统的解决方法多为对人脸进行剪切、降低码率以及进行图像差分的方法来降低传输数据流,但降低传输数据流则代表传输画面有缺失,不能传输清晰、流畅的画面,不能满足用户对视频通话流畅性和清晰度的需求。
因此,现有技术还有待改进和发展。
技术解决方案
本发明要解决的技术问题在于,针对现有技术的上述缺陷,提供一种虚拟形象视频通话方法、装置、终端设备及存储介质。
为了解决上述技术问题,本发明采用的技术方案如下:
一种虚拟形象视频通话方法,其中,所述方法包括:
对视频通话过程中的网络质量指标进行检测;
判断当前视频通话的网络质量指标是否满足预定条件;
当检测到视频通话的网络质量指标满足预定条件,则控制获取预先生成的通话用户的虚拟形象;
将当前视频通话界面切换为映射有用户行为的所述虚拟形象进行显示。
所述的虚拟形象视频通话方法,其中,所述对视频通话过程中的网络质量指标进行检测的步骤之前包括:
预先设置网络质量指标的预定条件,当检测到正在视频通话时的网络状态满足预定条件,则控制将视频通话界面切换为所述虚拟形象进行显示;
预先设置当网络质量指标满足预定条件时使用的,用于视频通话的虚拟形象。
所述的虚拟形象视频通话方法,其中,所述判断当前视频通话的网络质量指标是否满足预定条件的步骤包括:
所述网络质量指标包括网速、丢包率以及延迟;
检测当前网速是否低于第一数值和/或当前丢包率是否高于第二数值和/或检测当前延迟是否高于第三数值。
所述的虚拟形象视频通话方法,其中,所述当检测到视频通话的网络质量指标满足预定条件,则控制获取预先生成的通话用户的虚拟形象的步骤包括:
当检测到当前网速低于第一数值和/或当前丢包率高于第二数值和/或检测当前延迟高于第三数值时;
控制调用预先生成的通话用户的虚拟形象。
所述的虚拟形象视频通话方法,其中,所述将当前视频通话界面切换为映射有用户行为的所述虚拟形象进行显示的步骤包括:
将当前视频通话界面切换为虚拟形象进行显示;
并通过移动终端摄像头和/或声音传感器采集用户行为;
将用户行为映射到虚拟形象中并进行显示。
所述的虚拟形象视频通话方法,其中,所述对视频通话过程中的网络质量指标进行检测的步骤包括:
对视频通话过程中的网络质量指标进行实时检测或间隔固定时间进行检测。
所述的虚拟形象视频通话方法,其中,所述预设条件由用户根据使用习惯以及使用感受进行设置。
所述的虚拟形象视频通话方法,其中,所述控制获取预先生成的通话用户的虚拟形象的步骤之前包括:
对当前使用视频通话的通话用户人脸进行识别,检查所述通话用户是否有对应的预设虚拟形象,若有则直接调用所述预设虚拟形象,若无则分配模板虚拟形象用于视频通话。
所述的虚拟形象视频通话方法,其中,所述方法还包括:若所述通话用户没有对应的预设虚拟形象,则根据终端摄像头采集所述通话用户的外观信息并通过算法生成包含所述通话用户的外观特征的虚拟形象,并进行存储,使得所述通话用户后续再次使用视频通话时能够调用对应的预设虚拟形象。
所述的虚拟形象视频通话方法,其中,所述将当前视频通话界面切换为映射有用户行为的所述虚拟形象进行显示的步骤之后包括:
检测所述虚拟形象通话过程中的网络质量指标,当所述网络质量指标在预设的时间内持续均回到正常值,则将虚拟形象显示切换为视频通话界面。
一种虚拟形象视频通话切换方法,其中,所述方法包括:
当监测到接收或发出视频通话请求,检测是否接收到使用虚拟形象进行视频通话指令;
当检测接收到使用虚拟形象进行视频通话指令时,调用或生成虚拟形象,并将用户的行为映射到虚拟形象中;
检测到视频通话建立成功,将映射有用户行为的虚拟形象显示在视频通话画面中。
所述的虚拟形象视频通话切换方法,其中,所述当监测到接收或发出视频通话请求的步骤之前包括:
预先设置用于视频通话的虚拟形象,或预先设置根据用户外观特征自动生成虚拟形象的算法。
所述的虚拟形象视频通话切换方法,其中,所述检测到视频通话建立成功,将映射有用户行为的虚拟形象显示在视频通话画面中的步骤包括:
检测到视频通话建立成功后;
控制声音传感器采集用户声音,并通过语音识别算法识别用户声音所对应的口型以及用户说话时的情感;
控制将所述识别到的口型与情感映射为虚拟形象面部动作进行显示。
一种终端设备,其中,所述终端设备包括存储器、处理器及存储在所述存储器上并可在所述处理器上运行的虚拟形象视频通话程序,所述处理器执行所述虚拟形象视频通话程序时,实现任一项所述的虚拟形象视频通话方法的步骤。
一种计算机可读存储介质,其中,其上存储有虚拟形象视频通话程序,所述虚拟形象视频通话程序被处理器执行时,实现任一项所述的虚拟形象视频通话方法的步骤。
有益效果
与现有技术相比,本发明根据自动检测得到的视频通话中的网络质量指标,将所述网络质量指标与预先设置的网络质量指标的预定条件做对比,当对比结果显示当前网络质量较差时,将视频通话界面切换为有虚拟形象显示的界面。所述虚拟形象显示的界面可将用户的表情、口型、肢体动作等行为映射到虚拟形象上,并且在保持低数据传输量的同时拥有较高的画质,使人们在网络较差的情况下也能得到清晰、高品质的视频通话体验。
附图说明
下为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明中记载的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。
图1是本发明实施例提供的虚拟形象视频通话方法的具体实施方式的流程图。
图2是本发明实施例提供的虚拟形象视频通话切换方法的具体实施方式的流程图。
图3是本发明第三实施例的流程示意图。
图4是本发明实施例提供的终端设备的内部结构原理图。
本发明的实施方式
为使本发明的目的、技术方案及优点更加清楚、明确,以下参照附图并举实施例对本发明进一步详细说明。应当理解,此处所描述的具体实施例仅仅用以解释本发明,并不用于限定本发明。
需要说明,若本发明实施例中有涉及方向性指示(诸如上、下、左、右、前、后……),则该方向性指示仅用于解释在某一特定姿态(如附图所示)下各部件之间的相对位置关系、运动情况等,如果该特定姿态发生改变时,则该方向性指示也相应地随之改变。
另外,若本发明实施例中有涉及“第一”、“第二”等的描述,则该“第一”、“第二”等的描述仅用于描述目的,而不能理解为指示或暗示其相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括至少一个该特征。另外,各个实施例之间的技术方案可以相互结合,但是必须是以本领域普通技术人员能够实现为基础,当技术方案的结合出现相互矛盾或无法实现时应当认为这种技术方案的结合不存在,也不在本发明要求的保护范围之内。
随着科技的发展和人们生活水平的不断提高,人们对于画面显示的要求越来越高的同时,也已经习惯了高清、流畅的画质带来的视觉体验。但是在使用移动终端进行视频通话的时候,其通话画质却因为网络水平或移动网络费用依旧被限制在720P及以下的水准,甚至是在一些户外移动网络信号覆盖较差的位置进行视频通话时其画质会变得更差。例如因网速到不到要求时,用于视频通话的软件为了保证视频通话的正常进行,会将传输画面进行截取并放大,即原本视频通话的对端用户可以完整的看到上半身的画面经过放大后仅能看到肩膀以及放大后的脑袋,或者降低传输码率或者帧率对视频画面的数据进行缩减,人们无法以观看视频的流畅度和清晰度进行视频通话。
为了解决上述问题,本发明实施例提供一种虚拟形象视频通话方法,根据本实施例提供的虚拟形象视频通话方法,当在视频通话的过程中检测到网络质量较差,则会自动调用虚拟形象,并将当前视频通话界面切换为虚拟形象进行显示;所述虚拟形象的行为动作则根据移动终端传感器采集用户当前的行为动作映射得到。因为虚拟形象所需的传输数据流较小,所以当用户处在网络环境较差的位置时也能通过虚拟形象进行高画质、高帧率的视频通话,获得更好的视频通话体验。而且其虚拟形象的设置因人而异,可以是预先设置的模板虚拟形象,也可以是预先生成的包含用户特征的虚拟形象。
示例性方法
第一实施例
如图1中所示,本发明实施例提供一种虚拟形象视频通话方法,所述虚拟形象视频通话方法可用于移动终端。在本发明实施例中所述方法包括如下步骤:
步骤S100、对视频通话过程中的网络质量指标进行检测;
在本实施例中,当用户在视频通话过程中网络质量较差时会导致视频通话画质变差,则设置在视频通话的过程中持续对网络质量指标进行检测,所述表示网络质量指标的参数包括丢包率、延迟、网速等参数。所述检测方式包括实时检测以及间隔固定时间进行检测。
其中,在所述对视频通话过程中的网络质量进行检测的步骤之前包括:
预先设置网络质量指标的预定条件,当检测到正在视频通话时的网络状态满足预定条件,则控制将视频通话界面切换为所述虚拟形象进行显示;
预先设置当网络质量指标满足预定条件时使用的,用于视频通话的虚拟形象。
预先设置网络质量指标的预定条件,所述网络质量指标的预定条件代表,当当前的网络质量指标的参数满足预定条件时,认为当前网络质量较差,则控制将当前进行显示的视频通话界面切换为对网络质量依赖较小的虚拟形象进行显示,所述网络质量指标为用于反映当前网络稳定性以及网络信号强度的参数,包括但不限于丢包率、延迟、网速等参数。
所述预定条件为判断当前网速是否低于第一数值和/或当前丢包率是否高于第二数值和/或检测当前延迟是否高于第三数值。其中第一数值、第二数值以及第三数值分别为厂商出厂预设或通过用户自己设置得到,当其中任一项或任意多项的组合满足时则判断网络质量较差。
预先设置用于当网络质量较差时使用的,用于代替视频通话界面虚拟形象。所述虚拟形象的设置可由大数据进行提供,使用户预先选择并存储自己喜欢的虚拟形象,当检测到网络质量指标满足预定条件时直接调用该预先选择的虚拟形象进行显示,或根据用户的外观特征自动生成独一无二的虚拟形象并进行使用设置。
举例说明,用户A的手机A中预先设置了用于表示网络质量指标的参数,包括网速、丢包率以及网络延迟,并设定当网络质量指标参数中的网速低于0.5MB/s、丢包率大于30%以及网络延迟大于200ms中的任意一条件被满足时,判断当前网络质量较差;用户A通过手机A摄像头采集身体的外观信息包括五官、头部大小、脖子以及肩膀的大小等信息,并通过算法自动生成带有用户外观特点的虚拟形象预先存储在手机A中,并设置当视频通话过程中检测到网络质量较差时优先切换该虚拟形象进行显示。
预设完毕后,当用户A与用户B处于视频通话过程中,手机A实时检测当前的网速、丢包率以及网络延迟,例如其中一次检测得到网速为5MB/s、丢包率为5%、网络延迟为50ms。
为进一步节省手机功耗以及网络质量判断的准确性可以设置每间隔一定时间对网络质量指标进行一次检测,其间隔时间可由用户设置。因为每个用户对自己手机在使用过程中的性能以及信号强度都有一定的认知,有的手机的网络仅仅是某个时刻或某个短暂的时间内会变差,结合用户的习惯而言,用户有可能不认为该网络波动问题影响到手机或视频通话的正常使用,则将检测网络质量指标的时间间隔设置的较长一些;或有的用户因为其手机款式的原因导致手机在使用过程中的网络比其他型号的手机网络都要差,所以只要环境网络稍有波动就会带来较长时间内的网络变差的问题,则该用户可对应将网络质量指标的获取间隔设置较短,以便在网络变差的初始时刻就及时的将视频通话界面切换为虚拟形象进行显示。
不仅仅是网络质量指标的获取可以由用户设定,用于判断网络质量的网络质量指标的预定条件也可以让用户根据自身使用习惯以及使用感受做相应调整。例如在视频通话使用过程中并未感觉到视频通话界面卡顿以及画质严重变低就突然切换为虚拟形象,则代表此时的预定条件设置有误,用户通过对应的设置界面可自由设置网速、丢包率、网络延迟或其他的多个参数的预定条件,以保证在用户需要进行切换的情况下才对视频通话界进行切换。
进一步地,步骤S200、判断当前视频通话的网络质量指标是否满足预定条件;
在本实施例中,当获取到网络质量指标时,移动终端将获取的网络质量指标与预先设置的网络质量指标的预定条件做对比,判断当前视频通话的网络质量指标是否满足预定条件。
举例说明,当用户A的手机A预设的网络质量指标的预定条件为网速低于0.5MB/s、丢包率大于30%以及网络延迟大于200ms,且设置所述网络质量指标中任意一条件被满足时表示视频通话的网络质量指标满足预定条件。
例如用户A在视频通话过程中的某一时刻中检测得到网速为5MB/s、丢包率为5%以及网络延迟为50ms,则网速、丢包率、网络延迟均未满足预设的网络质量指标的预定条件,认为此时刻的网络质量正常;当在另一时刻中检测得到网速为0.2MB/s、丢包率为16%以及网络延迟为105ms,则网络质量指标中的网速一项满足了所述预定条件,则判断当前网络质量较差。
进一步地,步骤S300、当检测到视频通话的网络质量指标满足预定条件,则控制获取预先生成的通话用户的虚拟形象;
在本实施例中,当检测到视频通话时的网络质量指标满足预定条件,即当检测到当前网速低于第一数值和/或当前丢包率高于第二数值和/或检测当前延迟高于第三数值时,则调用预先生成并存储在移动终端存储空间中的通话用户的虚拟形象。
举例说明,当用户A在视频通话过程中检测到网络质量指标满足预定条件,网络质量较差时,调用预先生成并存储在移动终端存储空间中的代表通话用户的虚拟形象。
所述预先生成的虚拟形象包括用户从大数据中获取的模板虚拟形象,例如男性人物模板、女性人物模板或者其他有趣味性的虚拟形象包括小猫、小狗等虚拟形象;或是根据手机摄像头采集用户外观信息并通过算法生成的含有用户外观特征的虚拟形象,其用户外观包括用户的头型、五官以及身材等信息。
为进一步加强不同用户在使用虚拟形象进行通话时的区分度,在获取通话用户的虚拟形象的步骤之前对当前使用视频通话的通话用户的人脸进行识别,检查该用户是否有在手机内预设对应的虚拟形象,若有则直接调用所述预设的虚拟形象,若无则分配模板虚拟形象供其使用。或进一步的根据手机摄像头采集的用户的外观信息并通过算法临时生成包含该用户外观特征的虚拟形象,并将其存储在手机中,使得该用户后续再次使用该手机时能够调用自己的虚拟形象。
进一步地,步骤S400、将当前视频通话界面切换为映射有用户行为的所述虚拟形象进行显示。
在本实施例中,当控制调用预先生成的通话用户的虚拟形象后,将视频通话界面切换为该虚拟形象进行显示。并通过移动终端摄像头和/或声音传感器对用户的行为进行采集,所述行为包括面部表情、肢体动作,头部的晃动及转动以及通过声音识别分析得到的口部动作。并将所述用户行为映射到虚拟形象中进行显示。
举例说明,当用户A的手机A调用预先生成的带有用户A的外观特征的虚拟形象后,用户A的手机A传输给视频通话对端的画面切换为含有虚拟形象的画面。同时所述用户A的手机A通过手机A的前置摄像头采集用户行为,具体为通过算法识别用户的脸的轮廓以及鼻子、嘴巴、眼镜、眉毛的轮廓,通过所述轮廓的面积以及形状变化检测用户是否闭上眼睛,打开嘴巴等行为动作,以及通过用户脸型的形状以及面积变化判断用户头部是否进行转动,并将所采集识别到的用户的行为动作映射到所述虚拟形象中,使虚拟形象的行为动作与用户的动作保持一致,提高通话双方用户的通话体验。
考虑到手机性能有好有坏,手机采集用户行为的方法可仅根据声音获得,其实现方法为通过获取的用户音频识别其对话内容以及语气,并进一步将对话内容以及语气转化为口型以及面部表情并映射到虚拟形象中进行显示。通过此方法获取的用户行为大幅度降低虚拟形象视频通话对手机处理性能的要求,在损失部分虚拟形象可动性的基础上保证了通话双方的使用体验。
用户还可以根据自身手机的性能对用户行为采集过程进行自定义设置,包括仅采集头部转动和晃动,仅采集眼部、眉毛、鼻子、嘴巴中任意一项或多项进行映射,最大程度的保有虚拟形象可动性以及视频通话的流畅性。
进一步地,在所述将当前视频通话界面切换为映射有用户行为的所述虚拟形象进行显示的步骤之后包括:
检测虚拟形象通话过程中的网络质量指标,当用于检测网络质量的网络质量指标在预设的时间内持续均回到正常值,则将虚拟形象显示切换为视频通话界面。所述检测方式包括实时检测以及固定时间进行检测,前者可及时根据用户网络质量指标进行视频通话界面的切换,后者可降低处理器处理运算能力需求,降低能耗。
举例说明,预先设置当网络质量指标持续30s中内持续回到正常值,即未满足预定条件,则判断当前网络质量正常。当用户A在使用虚拟形象进行视频通话的过程中检测到网速为1.5MB/s、丢包率10%、网络延迟99ms,则从此刻开始计算直到30s内无任一网络质量指标满足预定条件,则判断网络质量恢复正常,手机A将虚拟形象显示切换为正常的视频通话界面。
通过以上实施例,使移动终端在用户进行视频通话的过程中检测网络质量,并在判断网络质量较差的情况下自动将视频通话界面切换为虚拟形象界面进行显示,并通过移动终端摄像头和/或声音传感器将用户的行为动作实时映射到虚拟形象中进行显示,通过使用数据流需求较小的虚拟形象代替视频通话界面的方法,使用户在网络质量较差的情况下依然能够保持清晰、流畅的通话画面。
第二实施例
如图2所示,在第二实施例中提供一种虚拟形象视频通话切换方法,包括如下步骤:
步骤A100、当监测到接收或发出视频通话请求,检测是否接收到使用虚拟形象进行视频通话指令;
在本实施例中,当移动终端监测到接收或发出视频通话请求,检测用户是否需要使用虚拟形象进行视频通话。
其中,在所述当监测到接收或发出视频通话请求的步骤之前包括:
预先设置用于视频通话的虚拟形象,或预先设置根据用户特征自动生成虚拟形象的算法。
举例说明,当用户B通过手机B拨出或接收到来自其他用户的视频通话请求时,所述手机B检测是否需要使用虚拟形象进行视频通话,即检测是否接收到使用虚拟形象进行视频通话指令。所述检测的方法包括在手机B的显示屏中显示是否使用虚拟形象进行视频通话的交互窗口,或检测用户B是否预先设置默认使用虚拟形象进行视频通话。
在通过所述手机B拨出或接收来自其他用户的视频通话请求之前,预先设置用于视频通话的虚拟形象,其虚拟形象可通过大数据进行下载和存储,包括男性人物模板、女性人物模板或一些趣味性的虚拟形象例如小猫、小狗,甚至是一些原本就不是生物形象例如长了眼睛和嘴巴的可乐、盒子。或预先设置根据用户特征自动生成虚拟形象的算法,其算法需配合移动终端摄像头采集用户的外观特征进行虚拟形象的生成,用户可以预先生成带有自己外观特征的虚拟形象并进行存储,也可以是使用移动终端进行虚拟形象视频通话的过程中实时生成该含有外观特征的虚拟形象并显示。
进一步地,步骤A200、当检测接收到使用虚拟形象进行视频通话指令时,调用或生成虚拟形象,并将用户的行为映射到虚拟形象中;
在本实施例中,当检测到用户需要使用虚拟形象进行视频通话时,调用或生成包含用户外观特征的虚拟形象,并将用户的行为映射到虚拟形象中。
举例说明,当检测到用户B选择需要使用虚拟形象进行视频通话的操作指令,或用户B在手机B中预先设置有默认或有限使用虚拟形象进行视频通话设定,则控制调用预先选择并存储在手机B中的虚拟形象,或通过预先存储的算法生成拥有用户外观特征的虚拟形象。执行如实施例1中的步骤将用户的行为映射到虚拟形象中。通过该方法使得用户可以在视频通话开始前选择是否使用虚拟形象进行视频通话,并且使用多种样式的虚拟形象使得视频通话具有趣味性,且根据用户的使用需求可使视频通话具有保密性的特点。
进一步地,步骤S300、检测到视频通话建立成功,将映射有用户行为的虚拟形象显示在视频通话画面中。
在本实施例中,检测到视频通话建立成功后,接收由对端用户发送的视频或虚拟形象,而本端用户也将调用并且映射有用户行为的虚拟形象显示在对端的视频通话画面中。同时不仅由己方可以设置使用和关闭虚拟形象,还可申请让对方使用虚拟形象进行互动,其一可以增加视频通话的趣味性,其二虚拟形象比起视频的方式更加清晰流畅且节省网络流量。
第三实施例
以下通过一具体应用实施例对本发明方法做进一步详细说明:
本具体应用实施例,移动终端以手机为例,如图3所示,本具体应用实施例的一种虚拟形象视频通话方法,包括如下步骤:
步骤S10、开始,进入步骤S11;
步骤S11、运营商营业厅在开卡时对人像进行采集或补录人像,进入步骤S12;
步骤S12、运营商将采集到的人像通过算法生成拥有用户外观特征的虚拟人像,进入步骤S13;
步骤S13、运营商将虚拟人像存储进入SIM卡或通过网络服务器将虚拟人像与SIM卡进行绑定,进入步骤S14;
步骤S14、当检测到用户使用该SIM卡进行视频通话时,根据用户是否使用虚拟形象的需求,并将SIM卡中预设的虚拟形象传输至对端用户,进入步骤S15;
步骤S15、视频通话对端接收到虚拟形象数据,进入步骤S16与步骤S17;
步骤S16、将通过网络接收到的虚拟形象进行显示,进入步骤S20;
步骤S17、根据视频通话对端的需求,可将视频通话号码或其虚拟形象进行存储以及备注;
步骤S20、结束。
由上可见,本发明具体应用实施例中,用户可以通过在运营商营业厅中采集个人外观特征并生成与本人有相同外观特征的虚拟形象并与SIM卡建立联系,可以在视频通话时将虚拟形象传输至对端显示屏中,起到隐私保密的作用,且可以如同姓名与电话号码一样将虚拟形象的进行存储与备注。
用户C在运营商营业厅开通SIM卡,运营商工作人员采集用户C的外观信息,其中至少包括脑袋大小及五官特征,并且根据采集到的用户C的外观信息自动生成含有用户C的外观信息的虚拟形象,并通过将所述用户C的虚拟形象存储进入SIM卡或通过网络服务器的形式将SIM卡以及虚拟形象进行绑定。当用户C在手机C中插入该SIM卡后,手机除了能够通过该SIM卡拨打电话还能够通过该SIM卡获取到所述虚拟形象。当检测到用户C即将进行视频通话或处于视频通话中时,根据用户C的需求自由切换虚拟形象进行显示,例如当前网络不佳无法进行高清的视频通话传输,则可以切换网速需求较小的虚拟形象继续进行视频通话,或者当用户C接收到不熟悉的人的视频通话请求或与朋友之间想要进行趣味性的视频通话请求时,则通过手动切换使用虚拟形象进行接听的方式进行视频通话。当视频通话对端接收到由互联网传输的虚拟形象后,将所述用户C的虚拟形象显示在终端设备的显示器上,进一步地可以像存储手机号码一样将用户C的虚拟形象进行存储,并编辑。
通过上述实施例使每个用户可以拥有一个独一无二的虚拟形象,并且根据用户需求例如网络不佳或出于隐私考虑自由的选择使用视频通话以及虚拟形象通话,提高用户的使用体验以及安全性。
基于上述实施例,本发明还提供了一种终端设备,其原理框图可以如图4所示。该终端设备包括通过系统总线连接的处理器、存储器、网络接口、显示屏。其中,该终端设备的处理器用于提供计算和控制能力。该终端设备的存储器包括非易失性存储介质、内存储器。该非易失性存储介质存储有操作系统和计算机程序。该内存储器为非易失性存储介质中的操作系统和计算机程序的运行提供环境。该终端设备的网络接口用于与外部的终端通过网络连接通信。该计算机程序被处理器执行时以实现一种虚拟形象视频通话。该终端设备的显示屏可以是液晶显示屏或者电子墨水显示屏。
本领域技术人员可以理解,图4中示出的原理框图,仅仅是与本发明方案相关的部分结构的框图,并不构成对本发明方案所应用于其上的终端设备的限定,具体的终端设备可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。
在一个实施例中,提供了一种终端设备,终端设备包括存储器、处理器及存储在处理器上并可在处理器上运行的虚拟形象视频通话程序,处理器执行如下步骤:
对视频通话过程中的网络质量指标进行检测;
判断当前视频通话的网络质量指标是否满足预定条件;
当检测到视频通话的网络质量指标满足预定条件,则控制获取预先生成的通话用户的虚拟形象;
将当前视频通话界面切换为映射有用户行为的所述虚拟形象进行显示。
其中,所述对视频通话过程中的网络质量指标进行检测的步骤之前包括:
预先设置网络质量指标的预定条件,当检测到正在视频通话时的网络状态满足预定条件,则控制将视频通话界面切换为所述虚拟形象进行显示;
预先设置当网络质量指标满足预定条件时使用的,用于视频通话的虚拟形象。
其中,所述判断当前视频通话的网络质量指标是否满足预定条件的步骤包括:
所述网络质量指标包括网速、丢包率以及延迟;
检测当前网速是否低于第一数值和/或当前丢包率是否高于第二数值和/或检测当前延迟是否高于第三数值。
其中,所述当检测到视频通话的网络质量指标满足预定条件,则控制获取预先生成的通话用户的虚拟形象的步骤包括:
当检测到当前网速低于第一数值和/或当前丢包率高于第二数值和/或检测当前延迟高于第三数值时;
控制调用预先生成的通话用户的虚拟形象。
其中,所述将当前视频通话界面切换为映射有用户行为的所述虚拟形象进行显示的步骤包括:
将当前视频通话界面切换为虚拟形象进行显示;
并通过移动终端摄像头和/或声音传感器采集用户行为;
将用户行为映射到虚拟形象中并进行显示。
其中,所述对视频通话过程中的网络质量指标进行检测的步骤包括:
对视频通话过程中的网络质量指标进行实时检测或间隔固定时间进行检测。
其中,所述预设条件由用户根据使用习惯以及使用感受进行设置。
其中,所述控制获取预先生成的通话用户的虚拟形象的步骤之前包括:
对当前使用视频通话的通话用户人脸进行识别,检查所述通话用户是否有对应的预设虚拟形象,若有则直接调用所述预设虚拟形象,若无则分配模板虚拟形象用于视频通话。
其中,若所述通话用户没有对应的预设虚拟形象,则根据终端摄像头采集所述通话用户的外观信息并通过算法生成包含所述通话用户的外观特征的虚拟形象,并进行存储,使得所述通话用户后续再次使用视频通话时能够调用对应的预设虚拟形象。
其中,所述将当前视频通话界面切换为映射有用户行为的所述虚拟形象进行显示的步骤之后包括:
检测所述虚拟形象通话过程中的网络质量指标,当所述网络质量指标在预设的时间内持续均回到正常值,则将虚拟形象显示切换为视频通话界面。
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机程序来指令相关的硬件来完成,所述的计算机程序可存储于一非易失性计算机可读取存储介质中,该计算机程序在执行时,可包括如上述各方法的实施例的流程。其中,本发明所提供的各实施例中所使用的对存储器、存储、数据库或其它介质的任何引用,均可包括非易失性和/或易失性存储器。非易失性存储器可包括只读存储器(ROM)、可编程ROM(PROM)、电可编程ROM(EPROM)、电可擦除可编程ROM(EEPROM)或闪存。易失性存储器可包括随机存取存储器(RAM)或者外部高速缓冲存储器。作为说明而非局限,RAM以多种形式可得,诸如静态RAM(SRAM)、动态RAM(DRAM)、同步DRAM(SDRAM)、双数据率SDRAM(DDRSDRAM)、增强型SDRAM(ESDRAM)、同步链路(Synchlink) DRAM(SLDRAM)、存储器总线(Rambus)直接RAM(RDRAM)、直接存储器总线动态RAM(DRDRAM)、以及存储器总线动态RAM(RDRAM)等。
综上所述,一种虚拟形象视频通话方法、终端设备及存储介质,包括:对视频通话过程中的网络质量指标进行检测;判断当前视频通话的网络质量指标是否满足预定条件;当检测到视频通话的网络质量指标满足预定条件,则控制获取预先生成的通话用户的虚拟形象;将当前视频通话界面切换为映射有用户行为的所述虚拟形象进行显示。旨在解决当用户在使用移动终端进行视频通话的过程中由于网络质量变差,仅能通过降低画质的方法来维持视频通话,导致用户使用体验变差的问题。通过当检测到用户视频通话过程中网络质量变差,自动将视频通话界面切换为预先设置的虚拟形象进行显示,在网络较差的情况下也能维持高清晰度和流畅度的显示效果,提高用户使用体验。
预先设置多个用于放置在不同工位上的支持NFC功能且带NFC标签的NFC座机,预先为每个用户设置一NFC穿戴设备,并将每个用户的NFC穿戴设备与其NFC座机的号码建立对应关系;当有电话呼入,查找与呼入电话分机号对应的NFC穿戴设备;通过与呼入电话分机号对应的NFC穿戴设备,获取与该NFC穿戴设备距离最近工位的NFC座机;控制将所述电话呼入切换至与呼入电话分机号对应的NFC穿戴设备,距离最近工位的NFC座机振铃。旨在解决固定电话不能随时随地接到来电的问题,使企业用户在离开自己工位时也能通过其他工位上的座机接到打给自己的电话,做到使来电寻人的目的,提高办公效率。
应当理解的是,本发明公开的应用不限于上述的举例,对本领域普通技术人员来说,可以根据上述说明加以改进或变换,所有这些改进和变换都应属于本发明所附权利要求的保护范围。

Claims (20)

  1. 一种虚拟形象视频通话方法,其中,所述方法包括:
    对视频通话过程中的网络质量指标进行检测;
    判断当前视频通话的网络质量指标是否满足预定条件;
    当检测到视频通话的网络质量指标满足预定条件,则控制获取预先生成的通话用户的虚拟形象;
    将当前视频通话界面切换为映射有用户行为的所述虚拟形象进行显示。
  2. 根据权利要求1所述的虚拟形象视频通话方法,其中,所述对视频通话过程中的网络质量指标进行检测的步骤之前包括:
    预先设置网络质量指标的预定条件,当检测到正在视频通话时的网络状态满足预定条件,则控制将视频通话界面切换为所述虚拟形象进行显示;
    预先设置当网络质量指标满足预定条件时使用的,用于视频通话的虚拟形象。
  3. 根据权利要求2所述的虚拟形象视频通话方法,其中,所述判断当前视频通话的网络质量指标是否满足预定条件的步骤包括:
    所述网络质量指标包括网速、丢包率以及延迟;
    检测当前网速是否低于第一数值和/或当前丢包率是否高于第二数值和/或检测当前延迟是否高于第三数值。
  4. 根据权利要求3所述的虚拟形象视频通话方法,其中,所述当检测到视频通话的网络质量指标满足预定条件,则控制获取预先生成的通话用户的虚拟形象的步骤包括:
    当检测到当前网速低于第一数值和/或当前丢包率高于第二数值和/或检测当前延迟高于第三数值时;
    控制调用预先生成的通话用户的虚拟形象。
  5. 根据权利要求1所述的虚拟形象视频通话方法,其中,所述将当前视频通话界面切换为映射有用户行为的所述虚拟形象进行显示的步骤包括:
    将当前视频通话界面切换为虚拟形象进行显示;
    并通过移动终端摄像头和/或声音传感器采集用户行为;
    将用户行为映射到虚拟形象中并进行显示。
  6. 根据权利要求1所述的虚拟形象视频通话方法,其中,所述对视频通话过程中的网络质量指标进行检测的步骤包括:
    对视频通话过程中的网络质量指标进行实时检测或间隔固定时间进行检测。
  7. 根据权利要求1所述的虚拟形象视频通话方法,其中,所述预设条件由用户根据使用习惯以及使用感受进行设置。
  8. 根据权利要求1所述的虚拟形象视频通话方法,其中,所述控制获取预先生成的通话用户的虚拟形象的步骤之前包括:
    对当前使用视频通话的通话用户人脸进行识别,检查所述通话用户是否有对应的预设虚拟形象,若有则直接调用所述预设虚拟形象,若无则分配模板虚拟形象用于视频通话。
  9. 根据权利要求8所述的虚拟形象视频通话方法,其中,所述方法还包括:
    若所述通话用户没有对应的预设虚拟形象,则根据终端摄像头采集所述通话用户的外观信息并通过算法生成包含所述通话用户的外观特征的虚拟形象,并进行存储,使得所述通话用户后续再次使用视频通话时能够调用对应的预设虚拟形象。
  10. 根据权利要求1所述的虚拟形象视频通话方法,其中,所述将当前视频通话界面切换为映射有用户行为的所述虚拟形象进行显示的步骤之后包括:
    检测所述虚拟形象通话过程中的网络质量指标,当所述网络质量指标在预设的时间内持续均回到正常值,则将虚拟形象显示切换为视频通话界面。
  11. 一种虚拟形象视频通话切换方法,其中,所述方法包括:
    当监测到接收或发出视频通话请求,检测是否接收到使用虚拟形象进行视频通话指令;
    当检测接收到使用虚拟形象进行视频通话指令时,调用或生成虚拟形象,并将用户的行为映射到虚拟形象中;
    检测到视频通话建立成功,将映射有用户行为的虚拟形象显示在视频通话画面中。
  12. 根据权利要求11所述的虚拟形象视频通话切换方法,其中,所述当监测到接收或发出视频通话请求的步骤之前包括:
    预先设置用于视频通话的虚拟形象,或预先设置根据用户外观特征自动生成虚拟形象的算法。
  13. 根据权利要求11所述的虚拟形象视频通话切换方法,其中,所述检测到视频通话建立成功,将映射有用户行为的虚拟形象显示在视频通话画面中的步骤包括:
    检测到视频通话建立成功后;
    控制声音传感器采集用户声音,并通过语音识别算法识别用户声音所对应的口型以及用户说话时的情感;
    控制将所述识别到的口型与情感映射为虚拟形象面部动作进行显示。
  14. 一种终端设备,其中,所述终端设备包括存储器、处理器及存储在所述存储器上并可在所述处理器上运行的虚拟形象视频通话程序,所述处理器执行所述虚拟形象视频通话程序时,执行以下步骤:
    对视频通话过程中的网络质量指标进行检测;
    判断当前视频通话的网络质量指标是否满足预定条件;
    当检测到视频通话的网络质量指标满足预定条件,则控制获取预先生成的通话用户的虚拟形象;
    将当前视频通话界面切换为映射有用户行为的所述虚拟形象进行显示。
  15. 根据权利要求14所述的终端设备,其中,所述处理器还用于执行:
    预先设置网络质量指标的预定条件,当检测到正在视频通话时的网络状态满足预定条件,则控制将视频通话界面切换为所述虚拟形象进行显示;
    预先设置当网络质量指标满足预定条件时使用的,用于视频通话的虚拟形象。
  16. 根据权利要求15所述的终端设备,其中,所述处理器还用于执行:
    所述网络质量指标包括网速、丢包率以及延迟;
    检测当前网速是否低于第一数值和/或当前丢包率是否高于第二数值和/或检测当前延迟是否高于第三数值。
  17. 根据权利要求16所述的终端设备,其中,所述处理器还用于执行:
    当检测到当前网速低于第一数值和/或当前丢包率高于第二数值和/或检测当前延迟高于第三数值时;
    控制调用预先生成的通话用户的虚拟形象。
  18. 根据权利要求14所述的终端设备,其中,所述处理器还用于执行:
    将当前视频通话界面切换为虚拟形象进行显示;
    并通过移动终端摄像头和/或声音传感器采集用户行为;
    将用户行为映射到虚拟形象中并进行显示。
  19. 根据权利要求14所述的终端设备,其中,所述处理器还用于执行:
    对视频通话过程中的网络质量指标进行实时检测或间隔固定时间进行检测。
  20. 一种计算机可读存储介质,其中,其上存储有虚拟形象视频通话程序,所述虚拟形象视频通话程序被处理器执行时,实现如权利要求1-10任一项所述的虚拟形象视频通话方法的步骤。
PCT/CN2022/104964 2021-08-09 2022-07-11 一种虚拟形象视频通话方法、终端设备及存储介质 WO2023016167A1 (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US18/579,842 US20240179272A1 (en) 2021-08-09 2022-07-11 Virtual image video call method, terminal device, and storage medium

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110908844.6 2021-08-09
CN202110908844.6A CN113838178A (zh) 2021-08-09 2021-08-09 一种虚拟形象视频通话方法、终端设备及存储介质

Publications (1)

Publication Number Publication Date
WO2023016167A1 true WO2023016167A1 (zh) 2023-02-16

Family

ID=78963134

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/104964 WO2023016167A1 (zh) 2021-08-09 2022-07-11 一种虚拟形象视频通话方法、终端设备及存储介质

Country Status (3)

Country Link
US (1) US20240179272A1 (zh)
CN (1) CN113838178A (zh)
WO (1) WO2023016167A1 (zh)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113838178A (zh) * 2021-08-09 2021-12-24 惠州Tcl云创科技有限公司 一种虚拟形象视频通话方法、终端设备及存储介质
CN114500912B (zh) * 2022-02-23 2023-10-24 联想(北京)有限公司 通话处理方法、电子设备以及存储介质
CN116740316B (zh) * 2023-06-16 2024-04-23 联通沃音乐文化有限公司 一种基于xr技术构建高精度营业厅人流监测全景展示方法

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105516638A (zh) * 2015-12-07 2016-04-20 掌赢信息科技(上海)有限公司 一种视频通话方法、装置和系统
CN105554429A (zh) * 2015-11-19 2016-05-04 掌赢信息科技(上海)有限公司 一种视频通话显示方法及视频通话设备
CN109936774A (zh) * 2019-03-29 2019-06-25 广州虎牙信息科技有限公司 虚拟形象控制方法、装置及电子设备
CN112672388A (zh) * 2020-12-21 2021-04-16 深圳酷派技术有限公司 通话方法、装置、存储介质及用户终端
US20210350604A1 (en) * 2020-05-06 2021-11-11 Magic Leap, Inc. Audiovisual presence transitions in a collaborative reality environment
CN113838178A (zh) * 2021-08-09 2021-12-24 惠州Tcl云创科技有限公司 一种虚拟形象视频通话方法、终端设备及存储介质

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105554429A (zh) * 2015-11-19 2016-05-04 掌赢信息科技(上海)有限公司 一种视频通话显示方法及视频通话设备
CN105516638A (zh) * 2015-12-07 2016-04-20 掌赢信息科技(上海)有限公司 一种视频通话方法、装置和系统
CN109936774A (zh) * 2019-03-29 2019-06-25 广州虎牙信息科技有限公司 虚拟形象控制方法、装置及电子设备
US20210350604A1 (en) * 2020-05-06 2021-11-11 Magic Leap, Inc. Audiovisual presence transitions in a collaborative reality environment
CN112672388A (zh) * 2020-12-21 2021-04-16 深圳酷派技术有限公司 通话方法、装置、存储介质及用户终端
CN113838178A (zh) * 2021-08-09 2021-12-24 惠州Tcl云创科技有限公司 一种虚拟形象视频通话方法、终端设备及存储介质

Also Published As

Publication number Publication date
US20240179272A1 (en) 2024-05-30
CN113838178A (zh) 2021-12-24

Similar Documents

Publication Publication Date Title
WO2023016167A1 (zh) 一种虚拟形象视频通话方法、终端设备及存储介质
CN109691054A (zh) 动画用户标识符
US7882532B2 (en) System and method for multiplexing media information over a network with reduced communications resources using prior knowledge/experience of a called or calling party
US7508413B2 (en) Video conference data transmission device and data transmission method adapted for small display of mobile terminals
US8599236B2 (en) Utilizing a video image from a video communication session as contact information
US10229507B1 (en) Expression transfer across telecommunications networks
US9319468B2 (en) Information processing apparatus and information processing method
WO2018127091A1 (zh) 一种图像处理的方法、装置、相关设备及服务器
CN108289185B (zh) 一种视频通信方法、装置及终端设备
WO2018120127A1 (zh) 虚拟现实设备及其来电管理方法
CN109151309A (zh) 一种摄像头的转动控制方法、装置、设备和存储介质
CN111770298A (zh) 视频通话方法、装置、电子设备以及存储介质
CN112669846A (zh) 交互系统、方法、装置、电子设备及存储介质
WO2022193635A1 (zh) 客服服务系统、方法、装置、电子设备及存储介质
WO2011003315A1 (zh) 一种基于移动终端的图像处理方法及移动终端
CN110581974B (zh) 人脸画面改进方法、用户终端和计算机可读存储介质
CN114915852B (zh) 视频通话交互方法、装置、计算机设备和存储介质
CN113099038B (zh) 图像超分处理方法、图像超分处理装置及存储介质
CN112134999B (zh) 一种视频彩铃的处理方法、设备及计算机可读存储介质
CN112995565B (zh) 显示设备的摄像头调整方法、显示设备及存储介质
Jang et al. Mobile video communication based on augmented reality
CN113050791A (zh) 交互方法、装置、电子设备及存储介质
JP2006253775A5 (zh)
CN113706430A (zh) 一种图像处理方法、装置和用于图像处理的装置
CN117201890B (zh) 一种视频会议的数据传输控制方法及系统

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22855150

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 18579842

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: DE