CN113726956A - Incoming call answering control method and device, terminal equipment and storage medium - Google Patents

Incoming call answering control method and device, terminal equipment and storage medium Download PDF

Info

Publication number
CN113726956A
CN113726956A CN202110892910.5A CN202110892910A CN113726956A CN 113726956 A CN113726956 A CN 113726956A CN 202110892910 A CN202110892910 A CN 202110892910A CN 113726956 A CN113726956 A CN 113726956A
Authority
CN
China
Prior art keywords
voice
incoming call
reply
content
answering
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110892910.5A
Other languages
Chinese (zh)
Inventor
张恩伟
周生明
张浩波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Xiaomi Mobile Software Co Ltd
Beijing Xiaomi Pinecone Electronic Co Ltd
Original Assignee
Beijing Xiaomi Mobile Software Co Ltd
Beijing Xiaomi Pinecone Electronic Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Xiaomi Mobile Software Co Ltd, Beijing Xiaomi Pinecone Electronic Co Ltd filed Critical Beijing Xiaomi Mobile Software Co Ltd
Priority to CN202110892910.5A priority Critical patent/CN113726956A/en
Publication of CN113726956A publication Critical patent/CN113726956A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72484User interfaces specially adapted for cordless or mobile telephones wherein functions are triggered by incoming communication events
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Telephone Function (AREA)

Abstract

The embodiment of the application provides a method, a device, a terminal device and a storage medium for controlling incoming call answering, wherein the method is applied to a first device and comprises the following steps: after determining that the incoming call of the second device is received, outputting prompt information; receiving instruction information detected based on the prompt information; answering the incoming call when the instruction information indicates alternative answering; and after the incoming call is answered, generating and sending a reply voice to the second equipment according to the voice sent by the second equipment.

Description

Incoming call answering control method and device, terminal equipment and storage medium
Technical Field
The present application relates to the field of voice control technologies, and in particular, to a method and an apparatus for controlling incoming call answering, a terminal device, and a storage medium.
Background
In the related art, after receiving an incoming call, a terminal device generally needs a user to manually slide or touch the terminal device to answer or hang up the incoming call, and after the incoming call is answered, the user also needs to perform voice communication with an opposite terminal in time, so that the situation that a caller hangs up the incoming call due to the fact that the caller does not hear a speaking sound for a long time is avoided.
In some situations, for example, when the user is driving or in a meeting, the user cannot answer the call manually or is inconvenient to make a voice call, and some important calls or calls will be missed.
Disclosure of Invention
In view of this, the present disclosure provides an incoming call answering control method, an incoming call answering control device, a terminal device, and a storage medium.
According to a first aspect of the embodiments of the present disclosure, there is provided a method for controlling incoming call answering, applied to a first device, including:
after determining that the incoming call of the second device is received, outputting prompt information;
receiving instruction information detected based on the prompt information;
answering the incoming call when the instruction information indicates alternative answering;
and after the incoming call is answered, generating and sending a reply voice to the second equipment according to the voice sent by the second equipment.
In one embodiment, the generating and sending the reply voice to the second device according to the voice sent by the second device includes:
recognizing voice content of voice transmitted by the second equipment;
determining reply content according to the voice content;
and generating the reply voice according to the reply content.
In one embodiment, the determining reply content according to the voice content includes:
and inputting the voice content into a machine learning model to obtain reply content output by the machine learning model.
In one embodiment, the generating the reply voice according to the reply content includes:
and generating a reply voice of the reply content according to the reply content by using a preset sound configuration.
In one embodiment, the method comprises:
answering the incoming call when the instruction information indicates answering;
and after the incoming call is answered, outputting a call audio provided by the second equipment, and sending the collected voice to the second equipment.
In one embodiment, the method further comprises:
and hanging up the incoming call when the instruction information indicates rejection.
According to a second aspect of the embodiments of the present disclosure, there is provided an incoming call answering control device, applied to a first device, including:
the prompting module is used for outputting prompting information after determining that the incoming call of the second equipment is received;
the receiving module is used for receiving instruction information detected based on the prompt information;
the control module is used for answering the incoming call when the instruction information indicates alternative answering;
and the voice reply module is used for generating and sending reply voice to the second equipment according to the voice sent by the second equipment after the incoming call is answered.
In one embodiment, the voice reply module includes:
the recognition unit is used for recognizing the voice content of the voice sent by the second equipment;
the determining unit is used for determining reply content according to the voice content;
and the generating unit is used for generating the reply voice according to the reply content.
In one embodiment, the determining unit is configured to input the speech content into a machine learning model, and obtain a reply content output by the machine learning model.
In an embodiment, the generating unit is configured to generate a reply voice of the reply content according to the reply content in a preset sound configuration.
In one embodiment, the apparatus comprises:
the control module is used for answering the incoming call when the instruction information indicates answering;
and the voice transmission module is used for outputting the call audio provided by the second equipment and sending the collected voice to the second equipment after the incoming call is answered.
In one embodiment, the control module is further configured to hang up the incoming call when the instruction information indicates a rejection.
According to a third aspect of the embodiments of the present disclosure, there is provided a terminal device including the apparatus of the second aspect.
According to a fourth aspect of embodiments of the present disclosure, there is provided a computer-readable storage medium storing executable instructions for causing a processor to implement the method of the first aspect when executed.
According to the incoming call answering control method provided by the embodiment of the disclosure, when the first device indicates alternative answering based on the instruction information detected by the prompt information, the incoming call is answered; and after the incoming call is answered, generating and sending a reply voice to the second equipment according to the voice sent by the second equipment. According to the embodiment of the invention, the incoming call is controlled to be answered through the instruction information, the corresponding reply voice is automatically generated according to the voice of the opposite terminal, and when the incoming call is received, the user only needs to provide one instruction information to control the whole call process, so that the situation that an important incoming call or call is not missed under the scene without call conditions is realized.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of embodiments of the disclosure.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the present disclosure and, together with the description, serve to explain the principles of the embodiments of the disclosure.
Fig. 1 is a flowchart illustrating a first incoming call answering control method according to an exemplary embodiment;
fig. 2 is a flowchart illustrating a second incoming call answering control method according to an exemplary embodiment;
fig. 3 is a flowchart illustrating a third incoming call answering control method according to an exemplary embodiment;
fig. 4 is a block diagram illustrating a first incoming call answering control device according to an exemplary embodiment;
fig. 5 is a block diagram illustrating a second incoming call answering control device according to an exemplary embodiment;
fig. 6 is a block diagram illustrating a third incoming call answering control device according to an exemplary embodiment;
FIG. 7 is a schematic block diagram illustrating a first device according to an exemplary embodiment;
fig. 8 is a block diagram illustrating a structure of an incoming call answering control device according to an exemplary embodiment.
Detailed Description
Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, like numbers in different drawings represent the same or similar elements unless otherwise indicated. The implementations described in the exemplary embodiments below are not intended to represent all implementations consistent with embodiments of the present disclosure. Rather, they are merely examples of apparatus and methods consistent with certain aspects of the disclosed embodiments, as detailed in the appended claims.
The terminology used in the embodiments of the present disclosure is for the purpose of describing particular embodiments only and is not intended to be limiting of the embodiments of the present disclosure. As used in the disclosed embodiments and the appended claims, the singular forms "a," "an," and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise. It should also be understood that the term "and/or" as used herein refers to and encompasses any and all possible combinations of one or more of the associated listed items.
It is to be understood that although the terms first, second, third, etc. may be used herein to describe various information in the embodiments of the present disclosure, such information should not be limited by these terms. These terms are only used to distinguish one type of information from another. For example, first information may also be referred to as second information, and similarly, second information may also be referred to as first information, without departing from the scope of embodiments of the present disclosure. The word "if" as used herein may be interpreted as "at … …" or "when … …" or "in response to a determination", depending on the context.
As shown in fig. 1, the present exemplary embodiment provides an incoming call answering control method, applied to a first device, including:
step S101: after determining that the incoming call of the second device is received, outputting prompt information;
step S102: receiving instruction information detected based on the prompt information;
step S103: answering the incoming call when the instruction information indicates alternative answering;
step S104: and after the incoming call is answered, generating and sending a reply voice to the second equipment according to the voice sent by the second equipment.
In the present embodiment, the first device or the second device may be, for example, a mobile phone, a Personal Digital Assistant (PDA) mobile phone, a smart phone, a palm computer with communication software installed therein, a tablet computer, a notebook computer, or the like.
In this embodiment, after determining that the incoming call of the second device is received, the first device automatically outputs a prompt message to ask the user how to respond. Here, the prompt information may include a source of an incoming call (i.e., a user of the second device), an inquiry as to whether to answer the current incoming call, etc., wherein the source of the incoming call includes phone number information determined according to the incoming call, a contact name corresponding to the phone number information determined according to the incoming call, etc., which is inquired from the address book.
The prompt message comprises: voice prompts and text prompts or icon prompts of display interface prompts.
The voice prompt may be a play voice, for example, play "137? "," 0755 × ","? "," liu one call, whether answer is now answered? "," mom incoming call, answer or not? "," # company incoming call, answer now? Is there an incoming call, whether an AI is enabled for automatic answering? "and the like.
Preferably, a speaker may be used to play the voice prompt.
The display interface prompt may be to display a prompt message on a display interface of the first device.
For example, the display interface displays the text "137? "," 0755 × ","? "," liu one call, whether answer is now answered? "," mom incoming call, answer or not? "," # company incoming call, answer now? Is there an incoming call, whether an AI is enabled for automatic answering? "and the like.
For another example, an option button is further displayed on the display interface, and the option button may include: and the user can select whether to answer or not and what way to answer the call instead of answering, rejecting and rejecting the call and then generating preset information and other options.
And after outputting the prompt information, the first device further receives instruction information corresponding to the prompt information.
For example, the voice of the user is received through the microphone to obtain the voice instruction information of the user, and the instruction information may be: voice instruction information.
For another example, a click operation is detected that acts on a position on the display interface corresponding to the option button.
For another example, a sliding operation acting on the display interface is detected, and if the sliding operation is rightward sliding, answering is indicated; if the sliding operation is leftward sliding, indicating rejection; and if the sliding operation is upward sliding, alternative answering is indicated.
It is to be understood that the specific action indicated by the sliding operation is only an example, and the definition of the action indicated by the sliding operation may also be performed in other ways according to the practical application habit of the user, which is not limited by the embodiment of the present disclosure.
In this embodiment, the instruction information may indicate: and sending preset information after alternative answering, refusing and the like.
Here, when the instruction information indicates alternative answering, the first device is instructed to answer the incoming call and automatically performs voice reply. For example, the instruction information may be: the small love helps me to answer the phone, the automatic answering mode and the like.
And when the instruction information indicates answering, indicating the first equipment to answer the incoming call and sending the collected voice information to the second equipment. For example, the instruction information may be: "hear", "good", "next", etc.
And when the instruction information indicates rejection, indicating the first equipment to hang up the incoming call. For example, the instruction information may be: "do not connect", "do not connect first", "do not", etc.
And when the instruction information indicates that the call is rejected and then the preset information is sent, rejecting the call of the second equipment, namely the call of the first equipment is hung up, and further sending the preset information to the second equipment, wherein the preset information comprises preset voice information or preset text information. For example, the instruction information may be: "don't connect, return electricity later", "something is there, return electricity after 5 minutes", "drive, return electricity later" and the like. The preset information can be sent in a short message form or a bound instant messaging software tool.
In some possible embodiments, the prompt message may be played in different timbres according to a pre-configured mode of the user.
In step S104, when the instruction information indicates alternative answering, after the incoming call is answered, the first device generates a reply voice according to the voice sent by the second device, and sends the reply voice to the second device.
In some possible embodiments, the reply voice may be a preset voice or a voice corresponding to the preset reply content.
For example, the first device has at least one set of dialog tones pre-stored therein, each set of dialog tones including a question tone and a reply tone, each question tone corresponding to a question text, and each reply tone corresponding to a reply text. After the incoming call is answered, the first equipment receives the voice sent by the second equipment, analyzes the text content of the voice, performs similarity matching on the text content and the pre-stored problem text, determines the problem text with the highest similarity, takes the reply audio corresponding to the problem text as the reply voice, and sends the reply voice to the second equipment.
In other possible embodiments, the reply voice may be a voice generated according to a schedule in the first device.
For example, a schedule is stored in the first device, the schedule including at least one schedule record. After the incoming call is answered, the first device receives the voice sent by the second device, analyzes the text content of the voice, matches the time keywords in the text content with the time items in the schedule record, and generates a reply voice according to the schedule record and the text content which meet the preset matching requirement. Specifically, for example, the first device receives the voice "what do you want to do this evening" sent by the second device? "the schedule includes two records" 17:30 and Li eating steak "and" 20:00 and Li going to the center of art to watch the movie "warwolf 2" related to tonight, then the generated reply voice can be "I'm 18:30 and Li eating steak at tonight, then 20:00 and Li going to the center of art to watch the movie" warwolf 2 ".
According to the incoming call answering control method provided by the embodiment of the disclosure, when the first device indicates alternative answering based on the instruction information detected by the prompt information, the incoming call is answered; and after the incoming call is answered, generating and sending a reply voice to the second equipment according to the voice sent by the second equipment. According to the embodiment of the invention, the incoming call is controlled to be answered through the instruction information, the corresponding reply voice is automatically generated according to the voice of the opposite terminal, and when the incoming call is received, the user only needs to provide one instruction information to control the whole call process, so that the situation that an important incoming call or call is not missed under the scene without call conditions is realized.
In some possible embodiments, the received speech is converted to text and displayed after the call is received.
In some possible embodiments, the method further comprises: the text of the reply voice is displayed.
Therefore, the user can conveniently check the incoming call content of the second equipment and the reply content to the second equipment when the user answers the call in a substitute mode through the text display.
In other embodiments, of course, when the first device answers the call of the second device instead of the user, the first device may display the text related to the call when answering the call of the second device, or may not display the text related to the call, but generate the call log according to the text related to the call. The following are recorded in the call log: and the text corresponding to the voice sent by the second equipment and the text corresponding to the voice replied by the first equipment.
In some embodiments, when the incoming call of the second device is automatically answered based on the indication information indicating the alternative answering, the first device may further output the voice sent by the second device, and play the reply voice sent by the first device to the second device, so that the user can know the conversation content based on the voice output of the first device.
In one embodiment, the first device maintains voice silence during the alternative listening process, that is, does not output the voice sent by the second device, and does not broadcast the reply voice sent by the first device.
In some possible embodiments, after the incoming call is answered, the call process is recorded and stored, so that the user can know the call content through playback later.
In some possible embodiments, after the incoming call is answered, corresponding text information is generated and stored in sequence for the voice sent by the second device and the reply voice sent to the second device during the call, so that the user can know the call content through review later.
In some possible embodiments, as shown in fig. 2, the step S104 includes:
step S1041: recognizing voice content of voice transmitted by the second equipment;
step S1042: determining reply content according to the voice content;
step S1043: and generating the reply voice according to the reply content.
In step S1041, when recognizing the speech content of the speech sent by the second device, the speech may be recognized by using a preset model (e.g., hidden markov model) to obtain the speech content.
Or, the voice can be recognized by using a preset voice recognition library comprising the voice characteristic parameters of the user, so as to obtain the voice content. Wherein, the voice characteristic parameters comprise: acoustic feature parameters, timbre, pitch, duration, and intensity, etc. The recognition error caused by the fact that the voice input by the user is not standard is avoided, and the user experience is improved.
It is to be understood that any other speech recognition method may be used to recognize the speech content of the speech transmitted by the second device, and the present application is not limited thereto.
In step S1042, the first device prestores text contents and reply contents having a corresponding relationship, where each reply content corresponds to at least one text content.
After the voice content of the voice sent by the second equipment is identified, firstly, the text content with the highest similarity with the voice content is determined through similarity matching; then, the corresponding reply content is determined based on the text content. Here, the similarity may include: word frequency similarity, word segmentation vector similarity and semantic similarity.
In some possible embodiments, the step S1042 includes:
step S10421: and inputting the voice content into a machine learning model to obtain reply content output by the machine learning model.
In the present embodiment, the reply content is obtained by recognizing the speech content through the machine learning model. Here, the machine learning model includes: deep neural networks-hidden markov models (DNN-HMMs), end-to-end models based on CTC loss functions, end-to-end models based on attention mechanisms, and the like.
In some possible embodiments, the step S1043 includes:
step S10431: and generating a reply voice of the reply content according to the reply content by using a preset sound configuration.
In this embodiment, according to the reply content, the reply voice is generated by acoustic parameters such as preset loudness, tone, timbre, pitch, duration, and intensity, so as to meet the personalized requirements of the user.
For example, the preset timbre may be: the tone of the user, or a preset tone such as a star or cartoon character.
In some embodiments, the method may comprise:
and determining the tone adopted by the reply voice according to the incoming call identification (such as the mobile phone number or the fixed-line phone number) of the second equipment. For example, when the second device receives a call, the contact person can be determined according to the address book, and the currently alternative answering reply voice is determined according to the corresponding relation between the contact person and the tone.
For example, the contact indicates family, and the user's own tone may be used. The family members may be classified into a elder, a grander, a child, etc., and the user may have a tone deviation due to different utterances, etc. when speaking to the family members, and the tone deviation may be preset based on the correspondence.
In the stream of people, if the communication identification based on the incoming call finds that the incoming call is a strange call, a star or an animation role and the like can be adopted for carrying out alternative answering.
In some possible embodiments, as shown in fig. 3, the method further comprises:
step S103': answering the incoming call when the instruction information indicates answering;
step S104': and after the incoming call is answered, outputting a call audio provided by the second equipment, and sending the collected voice to the second equipment.
In this embodiment, when the instruction information indicates answering, that is, indicates normal answering and answering an incoming call, and after answering is successful, receives and outputs a call audio of a user at the second device end, and collects a voice of the user at the first device end and sends the voice to the second device.
In some possible embodiments, with continuing reference to fig. 3, the method further comprises:
step S103': and hanging up the incoming call when the instruction information indicates rejection.
As shown in fig. 4, the present exemplary embodiment further provides an incoming call answering control device 10, which is applied in a first device and may include:
the prompting module 110 is configured to output a prompting message after determining that the incoming call of the second device is received;
a receiving module 120, configured to receive instruction information detected based on the prompt information;
a control module 130, configured to answer the incoming call when the instruction information indicates alternative answering;
the voice reply module 140 is configured to generate and send a reply voice to the second device according to the voice sent by the second device after the incoming call is answered.
In some possible embodiments, as shown in fig. 5, the voice reply module 140 includes:
a recognition unit 1401 configured to recognize a voice content of the voice transmitted by the second device;
a determining unit 1402, configured to determine reply content according to the voice content;
a generating unit 1403, configured to generate the reply voice according to the reply content.
In some possible embodiments, the determining unit 1402 is configured to input the speech content into a machine learning model, and obtain a reply content output by the machine learning model.
In some possible embodiments, the generating unit 1403 is configured to generate a reply voice of the reply content according to the reply content in a preset sound configuration.
In some possible embodiments, as shown in fig. 6, the device 10, comprises:
the control module 130 is configured to answer the incoming call when the instruction information indicates answering;
and a voice delivery module 150, configured to output a call audio provided by the second device after the incoming call is answered, and send the collected voice to the second device.
In some possible embodiments, the control module 130 is further configured to hang up the incoming call when the instruction information indicates a rejection.
One specific example is provided below in connection with any of the embodiments described above:
as shown in fig. 7, the first device includes therein:
the telephone monitoring module is used for monitoring the incoming call of the second equipment;
an AI call module, that is, the voice reply module 140, configured to generate and send a reply voice to the second device according to the voice sent by the second device after the incoming call is answered;
TTS (Text to Speech), that is, the generating unit 1403 is configured to generate the reply Speech according to the reply content;
an offline speech recognition, i.e., a recognition unit 1401, configured to recognize speech content of speech transmitted by the second device;
the recording module is used for recording and storing the call process after the incoming call is answered;
the telephone module is used for carrying out common answering and comprises: answering the incoming call, receiving and outputting the call audio of the user at the second equipment end after the incoming call is successfully answered, acquiring the voice of the user at the first equipment end, and sending the voice to the second equipment;
the resource file, namely the pre-stored audio and/or text, may be stored in the cloud, is a cloud resource file, and may also be stored in the first device.
Based on the structure of the first device, the incoming call control is performed according to the following steps:
A. after the first device receives the incoming call of the second device, the telephone answering assistant (such as a favorite) of the first device broadcasts a prompt message customized by the user in advance, such as: voice prompt "incoming call";
B. if the user says 'answer the call' (i.e. answer), the first device can put through the call according to a common answering mode, answer the call, receive and play the voice sent by the second device and send the collected voice information to the second device;
if the user says "hang up" (i.e., refuses), the first device will hang up the phone;
if the user says "favorites me connect" (i.e., alternative answer), the first device will answer and respond to the user in the AI-call auto-answer mode by its phone answering assistant.
The present exemplary embodiment further provides a terminal device, where the terminal device includes the apparatus in any of the foregoing embodiments.
The present exemplary embodiment also provides a computer-readable storage medium, which stores executable instructions for causing a processor to implement the method according to any one of the above embodiments when executed.
The computer-readable storage medium may be: a storage medium such as a removable storage device, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and various media capable of storing program codes may be selected as a non-transitory storage medium.
With regard to the apparatus in the above-described embodiment, the specific manner in which each module performs the operation has been described in detail in the embodiment related to the method, and will not be elaborated here.
In an exemplary embodiment, each module/Unit in the Device may be implemented by one or more Central Processing Units (CPUs), Graphics Processing Units (GPUs), Baseband Processors (BPs), Application Specific Integrated Circuits (ASICs), DSPs, Programmable Logic Devices (PLDs), Complex Programmable Logic Devices (CPLDs), Field Programmable Gate Arrays (FPGAs), general purpose processors, controllers, Micro Controllers (MCUs), microprocessors (microprocessors), or other electronic components, for performing the aforementioned methods.
Fig. 8 is a block diagram illustrating a fingerprint processing device 800 according to an example embodiment. For example, the apparatus 800 may be a mobile phone, a computer, a digital broadcast terminal, a messaging device, a game console, a tablet device, a medical device, an exercise device, a personal digital assistant, and the like.
Referring to fig. 8, the apparatus 800 may include one or more of the following components: processing component 802, memory 804, power component 806, multimedia component 808, audio component 810, input/output (I/O) interface 812, sensor component 814, and communication component 816.
The processing component 802 generally controls overall operation of the device 800, such as operations associated with display, telephone calls, data communications, camera operations, and recording operations. The processing components 802 may include one or more processors 820 to execute instructions to perform all or a portion of the steps of the methods described above. Further, the processing component 802 can include one or more modules that facilitate interaction between the processing component 802 and other components. For example, the processing component 802 can include a multimedia module to facilitate interaction between the multimedia component 808 and the processing component 802.
The memory 804 is configured to store various types of data to support operations at the apparatus 800. Examples of such data include instructions for any application or method operating on device 800, contact data, phonebook data, messages, pictures, videos, and so forth. The memory 804 may be implemented by any type or combination of volatile or non-volatile memory devices such as Static Random Access Memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable programmable read-only memory (EPROM), programmable read-only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, magnetic or optical disks.
Power components 806 provide power to the various components of device 800. The power components 806 may include a power management system, one or more power supplies, and other components associated with generating, managing, and distributing power for the apparatus 800.
The multimedia component 808 includes a screen that provides an output interface between the device 800 and a user. In some embodiments, the screen may include a Liquid Crystal Display (LCD) and a Touch Panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive an input signal from a user. The touch panel includes one or more touch sensors to sense touch, slide, and gestures on the touch panel. The touch sensor may not only sense the boundary of a touch or slide action, but also detect the duration and pressure associated with the touch or slide operation. In some embodiments, the multimedia component 808 includes a front facing camera and/or a rear facing camera. The front camera and/or the rear camera may receive external multimedia data when the device 800 is in an operating mode, such as a shooting mode or a video mode. Each front camera and rear camera may be a fixed optical lens system or have a focal length and optical zoom capability.
The audio component 810 is configured to output and/or input audio signals. For example, the audio component 810 includes a Microphone (MIC) configured to receive external audio signals when the apparatus 800 is in an operational mode, such as a call mode, a recording mode, and a voice recognition mode. The received audio signals may further be stored in the memory 804 or transmitted via the communication component 816. In some embodiments, audio component 810 also includes a speaker for outputting audio signals.
The I/O interface 812 provides an interface between the processing component 802 and peripheral interface modules, which may be keyboards, click wheels, buttons, etc. These buttons may include, but are not limited to: a home button, a volume button, a start button, and a lock button.
The sensor assembly 814 includes one or more sensors for providing various aspects of state assessment for the device 800. For example, the sensor assembly 814 may detect the open/closed status of the device 800, the relative positioning of components, such as a display and keypad of the device 800, the sensor assembly 814 may also detect a change in the position of the device 800 or a component of the device 800, the presence or absence of user contact with the device 800, the orientation or acceleration/deceleration of the device 800, and a change in the temperature of the device 800. Sensor assembly 814 may include a proximity sensor configured to detect the presence of a nearby object without any physical contact. The sensor assembly 814 may also include a light sensor, such as a CMOS or CCD image sensor, for use in imaging applications. In some embodiments, the sensor assembly 814 may also include an acceleration sensor, a gyroscope sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.
The communication component 816 is configured to facilitate communications between the apparatus 800 and other devices in a wired or wireless manner. The apparatus 800 may access a wireless network based on a communication standard, such as WiFi, 4G or 5G, or a combination thereof. In an exemplary embodiment, the communication component 816 receives a broadcast signal or broadcast related information from an external broadcast management system via a broadcast channel. In an exemplary embodiment, the communication component 816 further includes a Near Field Communication (NFC) module to facilitate short-range communications. For example, the NFC module may be implemented based on Radio Frequency Identification (RFID) technology, infrared data association (IrDA) technology, Ultra Wideband (UWB) technology, Bluetooth (BT) technology, and other technologies.
In an exemplary embodiment, the apparatus 800 may be implemented by one or more Application Specific Integrated Circuits (ASICs), Digital Signal Processors (DSPs), Digital Signal Processing Devices (DSPDs), Programmable Logic Devices (PLDs), Field Programmable Gate Arrays (FPGAs), controllers, micro-controllers, microprocessors or other electronic components for performing the above-described methods.
In an exemplary embodiment, a non-transitory computer-readable storage medium comprising instructions, such as the memory 804 comprising instructions, executable by the processor 820 of the device 800 to perform the above-described method is also provided. For example, the non-transitory computer readable storage medium may be a ROM, a Random Access Memory (RAM), a CD-ROM, a magnetic tape, a floppy disk, an optical data storage device, and the like.
Other embodiments of the disclosed embodiments will be apparent to those skilled in the art from consideration of the specification and practice of the invention disclosed herein. This application is intended to cover any variations, uses, or adaptations of the embodiments of the disclosure following, in general, the principles of the disclosure and including such departures from the present disclosure as come within known or customary practice within the art to which the disclosure pertains. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the disclosed embodiments being indicated by the following claims.
It is to be understood that the disclosed embodiments are not limited to the precise arrangements described above and shown in the drawings, and that various modifications and changes may be made without departing from the scope thereof. The scope of the embodiments of the present disclosure is limited only by the appended claims.

Claims (14)

1. A method for controlling call answering is applied to a first device and comprises the following steps:
after determining that the incoming call of the second device is received, outputting prompt information;
receiving instruction information detected based on the prompt information;
answering the incoming call when the instruction information indicates alternative answering;
and after the incoming call is answered, generating and sending a reply voice to the second equipment according to the voice sent by the second equipment.
2. The method of claim 1, wherein generating and sending a reply voice to the second device based on the voice sent by the second device comprises:
recognizing voice content of voice transmitted by the second equipment;
determining reply content according to the voice content;
and generating the reply voice according to the reply content.
3. The method of claim 2, wherein determining reply content based on the voice content comprises:
and inputting the voice content into a machine learning model to obtain reply content output by the machine learning model.
4. The method according to claim 2, wherein the generating the reply voice according to the reply content comprises:
and generating a reply voice of the reply content according to the reply content by using a preset sound configuration.
5. The method of claim 1, comprising:
answering the incoming call when the instruction information indicates answering;
and after the incoming call is answered, outputting a call audio provided by the second equipment, and sending the collected voice to the second equipment.
6. The method of claim 1, further comprising:
and hanging up the incoming call when the instruction information indicates rejection.
7. A incoming call answering control device is applied to first equipment and comprises:
the prompting module is used for outputting prompting information after determining that the incoming call of the second equipment is received;
the receiving module is used for receiving instruction information detected based on the prompt information;
the control module is used for answering the incoming call when the instruction information indicates alternative answering;
and the voice reply module is used for generating and sending reply voice to the second equipment according to the voice sent by the second equipment after the incoming call is answered.
8. The apparatus of claim 7, wherein the voice reply module comprises:
the recognition unit is used for recognizing the voice content of the voice sent by the second equipment;
the determining unit is used for determining reply content according to the voice content;
and the generating unit is used for generating the reply voice according to the reply content.
9. The apparatus of claim 8, wherein the determining unit is configured to input the speech content into a machine learning model, and obtain a reply content output by the machine learning model.
10. The apparatus according to claim 8, wherein the generating unit is configured to generate a reply voice of the reply content in a preset sound configuration according to the reply content.
11. The apparatus of claim 7, wherein the apparatus comprises:
the control module is used for answering the incoming call when the instruction information indicates answering;
and the voice transmission module is used for outputting the call audio provided by the second equipment and sending the collected voice to the second equipment after the incoming call is answered.
12. The apparatus of claim 7, wherein the control module is further configured to hang up the incoming call when the instruction information indicates a rejection.
13. A terminal device, characterized in that it comprises the apparatus of any of claims 7 to 12.
14. A computer-readable storage medium having stored thereon executable instructions for causing a processor, when executed, to implement the method of any one of claims 1 to 6.
CN202110892910.5A 2021-08-04 2021-08-04 Incoming call answering control method and device, terminal equipment and storage medium Pending CN113726956A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110892910.5A CN113726956A (en) 2021-08-04 2021-08-04 Incoming call answering control method and device, terminal equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110892910.5A CN113726956A (en) 2021-08-04 2021-08-04 Incoming call answering control method and device, terminal equipment and storage medium

Publications (1)

Publication Number Publication Date
CN113726956A true CN113726956A (en) 2021-11-30

Family

ID=78674856

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110892910.5A Pending CN113726956A (en) 2021-08-04 2021-08-04 Incoming call answering control method and device, terminal equipment and storage medium

Country Status (1)

Country Link
CN (1) CN113726956A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2024114233A1 (en) * 2022-11-30 2024-06-06 华为技术有限公司 Call method and electronic device

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110401777A (en) * 2019-08-02 2019-11-01 上海尊源通讯技术有限公司 A kind of AI phone secretary system based on communication terminal
CN111683175A (en) * 2020-04-22 2020-09-18 北京捷通华声科技股份有限公司 Method, device, equipment and storage medium for automatically answering incoming call

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110401777A (en) * 2019-08-02 2019-11-01 上海尊源通讯技术有限公司 A kind of AI phone secretary system based on communication terminal
CN111683175A (en) * 2020-04-22 2020-09-18 北京捷通华声科技股份有限公司 Method, device, equipment and storage medium for automatically answering incoming call

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2024114233A1 (en) * 2022-11-30 2024-06-06 华为技术有限公司 Call method and electronic device

Similar Documents

Publication Publication Date Title
KR102233343B1 (en) Providing a personal assistant module with a selectively-traversable state machine
US9031842B2 (en) Methods and devices for facilitating communications
EP2210214B1 (en) Automatic identifying
US9111538B2 (en) Genius button secondary commands
CN107767864B (en) Method and device for sharing information based on voice and mobile terminal
CN111063354B (en) Man-machine interaction method and device
CN103026740A (en) Method and apparatus for suggesting a message segment
CN105704766B (en) The control method and device of double-card mobile terminal
CN111556197B (en) Method and device for realizing voice assistant and computer storage medium
CN105915521B (en) Multi-party call management method, device and terminal
CN104219644A (en) Emergency communication method and device
CN110619873A (en) Audio processing method, device and storage medium
CN107945806B (en) User identification method and device based on sound characteristics
CN105260247A (en) Method and device for controlling terminal application
CN105677023B (en) Information demonstrating method and device
CN106534459A (en) Voice prompt method and device
CN106603381B (en) Method and device for processing chat information
CN106657543B (en) Voice information processing method and device
CN106384586A (en) Method and device for reading text information
CN105100352B (en) Obtain the method and device of associated person information
CN113726956A (en) Incoming call answering control method and device, terminal equipment and storage medium
CN111816174A (en) Speech recognition method, device and computer readable storage medium
CN112863511B (en) Signal processing method, device and storage medium
CN109451135A (en) Contact person's creation method and device
CN108234782B (en) Incoming call processing method, device and equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination