CN113938556B

CN113938556B - Incoming call prompting method and device and electronic equipment

Info

Publication number: CN113938556B
Application number: CN202010675888.4A
Authority: CN
Inventors: 李向东
Original assignee: Huawei Technologies Co Ltd
Current assignee: Huawei Technologies Co Ltd
Priority date: 2020-07-14
Filing date: 2020-07-14
Publication date: 2023-03-10
Anticipated expiration: 2040-07-14
Also published as: CN113938556A

Abstract

The embodiment of the application provides an incoming call prompting method, an incoming call prompting device and electronic equipment, wherein in the method, voice signals of parties except a user of the electronic equipment in a voice call process are obtained, parameter values of preset voice parameters are extracted from the voice signals, and voice synthesis is carried out according to the extracted parameter values of the preset voice parameters to obtain first voice signals; the method comprises the steps of setting incoming call prompt tones of incoming call numbers of a calling party according to a first voice signal, receiving the incoming calls of the incoming call numbers, determining that the incoming call prompt mode comprises voice prompt, and playing the incoming call prompt tones of the incoming call numbers, so that different personalized prompt tones can be used for different incoming call numbers, and a user can distinguish callers relatively easily according to the personalized prompt tones.

Description

Incoming call prompting method and device and electronic equipment

Technical Field

The present application relates to the field of sound processing technologies, and in particular, to a method and an apparatus for prompting an incoming call, and an electronic device.

Background

In the prior art, when an electronic device such as a mobile phone receives an incoming call by making a call from another person, a voice prompt method for the incoming call is mainly preset music or voice broadcast of a telephone number and/or a name of the caller. Hereinafter, a person who makes a call is referred to as a caller, and a sound which is played to a user when the user receives the call and prompts the user of the call is referred to as a prompt sound.

If the incoming call prompt tone is music, the user generally sets one music for the common contact in the address list, and then sets different music for a specific single contact or contact group in the address list. However, each kind of music for making an incoming call alert generally corresponds to many contacts except music set for a particular single contact, and therefore, the user cannot know the caller through the music for making an incoming call alert in most cases.

If the prompt tone of the incoming call is the voice broadcast of the telephone number and/or the name of the caller, the voice broadcast is too single, so that the prompt tone is not personalized.

Disclosure of Invention

The embodiment of the application provides a method and a device for prompting an incoming call and electronic equipment, which can use different personalized prompt tones for different incoming call numbers and enable a user to distinguish an caller relatively easily according to the personalized prompt tones.

In a first aspect, an embodiment of the present application provides an incoming call prompting method, which is applied to an electronic device with a call function, and the method includes:

acquiring voice signals of other parties except a user of the electronic equipment in the voice call process;

extracting parameter values of preset voice parameters from the voice signals; performing voice synthesis according to the extracted parameter value of the preset voice parameter to obtain a first voice signal; setting an incoming call prompt tone of the incoming call number of the calling party according to the first voice signal;

and receiving the incoming call of the incoming call number, determining that the incoming call prompt mode comprises voice prompt, and playing the incoming call prompt tone of the incoming call number.

In the embodiment, the prompt tone with the voice characteristics of the calling party (namely, the caller to which the calling number belongs) is set for the calling number, so that the individual requirements of the calling prompt tone are met, and moreover, the calling prompt tone of each calling number has the voice characteristics of the caller, so that a user can relatively easily distinguish the caller according to the prompt tone.

In one possible implementation manner, the method further includes:

judging whether an incoming call of the incoming call number of a calling party is received for the first time;

if yes, executing the step of voice synthesis according to the extracted parameter value of the preset voice parameter;

if not, carrying out voice synthesis according to the parameter value of the preset voice parameter extracted from the voice signal and the parameter value of the preset voice parameter corresponding to the incoming call number to obtain a second voice signal; extracting a parameter value of a preset voice parameter corresponding to the incoming call number from a voice signal of a calling party in at least one voice call process before the voice call process;

and setting the incoming call prompt tone of the incoming call number according to the second voice signal.

In this embodiment, the parameter value of the preset voice parameter corresponding to the incoming call number is extracted from the voice signal of the calling party in at least one voice call process before the voice call process, so that the voice feature of the caller can be better represented according to the parameter value of the preset voice parameter extracted from the voice signal and the second voice signal obtained by performing voice synthesis on the parameter value of the preset voice parameter corresponding to the incoming call number, and thus the caller can be more easily distinguished by the user according to the incoming call prompt tone set by the second voice signal.

In a possible implementation manner, before performing speech synthesis according to a parameter value of a preset speech parameter extracted from a speech signal and a parameter value of a preset speech parameter corresponding to an incoming call number, the method further includes:

and judging whether the incoming call prompt tone updating condition of the incoming call number is met according to the parameter value of the preset voice parameter extracted from the voice signal and the parameter value of the preset voice parameter corresponding to the incoming call number.

In one possible implementation manner, performing speech synthesis according to a parameter value of a preset speech parameter extracted from a speech signal and a parameter value of a preset speech parameter corresponding to an incoming call number includes:

updating the parameter value of the preset voice parameter corresponding to the incoming call number by using the parameter value of the preset voice parameter extracted from the voice signal;

and carrying out voice synthesis according to the parameter value with the maximum weight of each preset voice parameter corresponding to the incoming call number.

In this embodiment, voice synthesis is performed according to the parameter value with the largest weight of each preset voice parameter corresponding to the caller number, so that the second voice signal obtained by voice synthesis can better represent the voice feature of the caller, and thus the caller can be more easily distinguished by the user according to the call prompt tone set by the second voice signal.

In one possible implementation manner, the setting of the incoming call prompt tone of the incoming call number according to the second voice signal includes:

setting the second voice signal as an incoming call prompt tone of the incoming call number; alternatively, the first and second electrodes may be,

displaying an editing interface of the second voice signal for a user, and carrying out voice synthesis again according to the editing operation of the user in the editing interface to obtain a third voice signal; and when receiving the setting confirmation operation of the user in the editing interface, setting the third voice signal as the incoming call prompt tone of the incoming call number.

In one possible implementation manner, the setting of the incoming call prompt tone of the incoming call number according to the first voice signal includes:

setting the first voice signal as an incoming call prompt tone of an incoming call number; alternatively, the first and second electrodes may be,

displaying an editing interface of the first voice signal for a user, and carrying out voice synthesis again according to the editing operation of the user in the editing interface to obtain a fourth voice signal; and when receiving the setting confirmation operation of the user in the editing interface, setting the fourth voice signal as the incoming call prompt tone of the incoming call number.

In one possible implementation manner, the extracting a parameter value of a preset speech parameter from a speech signal includes:

sampling a voice signal according to a preset sampling rule to obtain a sampling signal;

and extracting the parameter value of the preset voice parameter according to the sampling signal.

In one possible implementation, sampling the speech signal according to a preset sampling rule includes:

starting to sample the voice signal from the call starting time of the voice call until the time length of the sampled voice signal reaches the preset target total time length or the voice call is ended; alternatively, the first and second electrodes may be,

sampling voice signals of which the signal intensity exceeds a preset intensity threshold value in the voice signals until the total duration of the sampled signals reaches a preset target total duration or the voice call is ended; alternatively, the first and second electrodes may be,

and sampling voice signals with preset words in the voice signals until the total duration of the sampling signals reaches a preset target total duration or the voice call is ended.

In a second aspect, an embodiment of the present application provides an electronic device, including:

one or more processors; a memory; and one or more computer programs, wherein the one or more computer programs are stored in the memory, the one or more computer programs comprising instructions which, when executed by the apparatus, cause the apparatus to perform the method of any of the first aspects.

In a third aspect, embodiments of the present application provide a computer-readable storage medium, in which a computer program is stored, which, when run on a computer, causes the computer to perform the method of any one of the first aspect.

In a fourth aspect, embodiments of the present application provide a computer program product, which when run on an electronic device, causes a processor to perform the method of any one of the first aspect.

Drawings

FIG. 1 is a block diagram of one embodiment of an electronic device of the present application;

FIG. 2 is a diagram of a software system architecture of an electronic device according to the present application;

FIGS. 3A to 3F are exemplary Graphical User Interfaces (GUI) for an incoming call prompt method according to the present application;

FIG. 4 is a flowchart of an embodiment of an incoming call alert method of the present application;

FIG. 5 is a flowchart of another embodiment of an incoming call prompt method of the present application;

fig. 6 is a structural diagram of an embodiment of the incoming call prompt device of the present application.

Detailed Description

The terminology used in the description of the embodiments section of the present application is for the purpose of describing particular embodiments of the present application only and is not intended to be limiting of the present application.

The technical solutions in the embodiments of the present application will be described below with reference to the drawings in the embodiments of the present application. Wherein in the description of the embodiments of the present application, "/" indicates an OR meaning, for example, A/B may indicate A or B; "and/or" herein is merely an association describing an associated object, and means that there may be three relationships, e.g., a and/or B, which may mean: a exists alone, A and B exist simultaneously, and B exists alone. In addition, in the description of the embodiments of the present application, "a plurality" means two or more than two.

As described in the background section, in the prior art, when an electronic device, such as a mobile phone, receives an incoming call from another person (i.e., a caller), the incoming call is mainly a preset music, or a voice broadcast of a phone number and/or a name of the caller.

The embodiment of the application provides a call prompt method, which sets prompt tones with the voice characteristics of the caller to which the call number belongs for the call number which has made a call, so as to meet the individual requirements of the call prompt tones, and moreover, as the call prompt tone of each call number has the voice characteristics of the caller, the user can distinguish the caller relatively easily according to the prompt tones.

The incoming call prompting method can be applied to electronic equipment with a call function, such as a mobile phone, wearable equipment, a vehicle-mounted telephone and the like.

By way of example, fig. 1 shows a schematic structural diagram of an electronic device 100. The electronic device 100 may include a processor 110, an external memory interface 120, an internal memory 121, a Universal Serial Bus (USB) interface 130, a charging management module 140, a power management module 141, a battery 142, an antenna 1, an antenna 2, a mobile communication module 150, a wireless communication module 160, an audio module 170, a speaker 170A, a receiver 170B, a microphone 170C, an earphone interface 170D, a sensor module 180, a key 190, a motor 191, an indicator 192, a camera 193, a display screen 194, a Subscriber Identity Module (SIM) card interface 195, and the like. The sensor module 180 may include a pressure sensor 180A, a gyroscope sensor 180B, an air pressure sensor 180C, a magnetic sensor 180D, an acceleration sensor 180E, a distance sensor 180F, a proximity light sensor 180G, a fingerprint sensor 180H, a temperature sensor 180J, a touch sensor 180K, an ambient light sensor 180L, a bone conduction sensor 180M, and the like.

It is to be understood that the illustrated structure of the embodiment of the present invention does not specifically limit the electronic device 100. In other embodiments of the present application, electronic device 100 may include more or fewer components than shown, or some components may be combined, some components may be split, or a different arrangement of components. The illustrated components may be implemented in hardware, software, or a combination of software and hardware.

Processor 110 may include one or more processing units, such as: the processor 110 may include an Application Processor (AP), a modem processor, a Graphics Processing Unit (GPU), an Image Signal Processor (ISP), a controller, a video codec, a Digital Signal Processor (DSP), a baseband processor, and/or a neural-Network Processing Unit (NPU), etc. The different processing units may be separate devices or may be integrated into one or more processors.

The controller can generate an operation control signal according to the instruction operation code and the timing signal to complete the control of instruction fetching and instruction execution.

A memory may also be provided in the processor 110 for storing instructions and data. In some embodiments, the memory in the processor 110 is a cache memory. The memory may hold instructions or data that have just been used or recycled by the processor 110. If the processor 110 needs to reuse the instruction or data, it can be called directly from the memory. Avoiding repeated accesses reduces the latency of the processor 110, thereby increasing the efficiency of the system.

In some embodiments, processor 110 may include one or more interfaces. The interface may include an integrated circuit (I2C) interface, an integrated circuit built-in audio (I2S) interface, a Pulse Code Modulation (PCM) interface, a universal asynchronous receiver/transmitter (UART) interface, a Mobile Industry Processor Interface (MIPI), a general-purpose input/output (GPIO) interface, a Subscriber Identity Module (SIM) interface, and/or a Universal Serial Bus (USB) interface, etc.

The I2C interface is a bidirectional synchronous serial bus including a serial data line (SDA) and a Serial Clock Line (SCL). In some embodiments, processor 110 may include multiple sets of I2C buses. The processor 110 may be coupled to the touch sensor 180K, a charger, a flash, a camera 193, etc. through different I2C bus interfaces, respectively. For example: the processor 110 may be coupled to the touch sensor 180K through an I2C interface, so that the processor 110 and the touch sensor 180K communicate through an I2C bus interface to implement a touch function of the electronic device 100.

The I2S interface may be used for audio communication. In some embodiments, processor 110 may include multiple sets of I2S buses. The processor 110 may be coupled to the audio module 170 through an I2S bus, enabling communication between the processor 110 and the audio module 170. In some embodiments, the audio module 170 may transmit an audio signal to the wireless communication module 160 through the I2S interface, so as to implement a function of answering a call through a bluetooth headset.

The PCM interface may also be used for audio communication, sampling, quantizing and encoding analog signals. In some embodiments, audio module 170 and wireless communication module 160 may be coupled by a PCM bus interface. In some embodiments, the audio module 170 may also transmit audio signals to the wireless communication module 160 through the PCM interface, so as to implement a function of answering a call through a bluetooth headset. Both the I2S interface and the PCM interface may be used for audio communication.

The UART interface is a universal serial data bus used for asynchronous communications. The bus may be a bidirectional communication bus. It converts the data to be transmitted between serial communication and parallel communication. In some embodiments, a UART interface is generally used to connect the processor 110 and the wireless communication module 160. For example: the processor 110 communicates with a bluetooth module in the wireless communication module 160 through a UART interface to implement a bluetooth function. In some embodiments, the audio module 170 may transmit the audio signal to the wireless communication module 160 through a UART interface, so as to realize the function of playing music through a bluetooth headset.

The MIPI interface may be used to connect the processor 110 with peripheral devices such as the display screen 194, the camera 193, and the like. The MIPI interface includes a Camera Serial Interface (CSI), a Display Serial Interface (DSI), and the like. In some embodiments, processor 110 and camera 193 communicate through a CSI interface to implement the capture functionality of electronic device 100. The processor 110 and the display screen 194 communicate through the DSI interface to implement the display function of the electronic device 100.

The GPIO interface may be configured by software. The GPIO interface may be configured as a control signal and may also be configured as a data signal. In some embodiments, a GPIO interface may be used to connect the processor 110 with the camera 193, the display 194, the wireless communication module 160, the audio module 170, the sensor module 180, and the like. The GPIO interface may also be configured as an I2C interface, I2S interface, UART interface, MIPI interface, and the like.

The USB interface 130 is an interface conforming to the USB standard specification, and may specifically be a Mini USB interface, a Micro USB interface, a USB Type C interface, or the like. The USB interface 130 may be used to connect a charger to charge the electronic device 100, and may also be used to transmit data between the electronic device 100 and a peripheral device. And the earphone can also be used for connecting an earphone and playing audio through the earphone. The interface may also be used to connect other electronic devices, such as AR devices and the like.

It should be understood that the connection relationship between the modules according to the embodiment of the present invention is only illustrative, and is not limited to the structure of the electronic device 100. In other embodiments of the present application, the electronic device 100 may also adopt different interface connection manners or a combination of multiple interface connection manners in the above embodiments.

The charging management module 140 is configured to receive charging input from a charger. The charger may be a wireless charger or a wired charger. In some wired charging embodiments, the charging management module 140 may receive charging input from a wired charger via the USB interface 130. In some wireless charging embodiments, the charging management module 140 may receive a wireless charging input through a wireless charging coil of the electronic device 100. The charging management module 140 may also supply power to the electronic device through the power management module 141 while charging the battery 142.

The power management module 141 is used to connect the battery 142, the charging management module 140 and the processor 110. The power management module 141 receives input from the battery 142 and/or the charge management module 140, and supplies power to the processor 110, the internal memory 121, the display 194, the camera 193, the wireless communication module 160, and the like. The power management module 141 may also be used to monitor parameters such as battery capacity, battery cycle count, battery state of health (leakage, impedance), etc. In other embodiments, the power management module 141 may be disposed in the processor 110. In other embodiments, the power management module 141 and the charging management module 140 may be disposed in the same device.

The wireless communication function of the electronic device 100 may be implemented by the antenna 1, the antenna 2, the mobile communication module 150, the wireless communication module 160, a modem processor, a baseband processor, and the like.

The antennas 1 and 2 are used for transmitting and receiving electromagnetic wave signals. Each antenna in the electronic device 100 may be used to cover a single or multiple communication bands. Different antennas can also be multiplexed to improve the utilization of the antennas. For example: the antenna 1 may be multiplexed as a diversity antenna of a wireless local area network. In other embodiments, the antenna may be used in conjunction with a tuning switch.

The mobile communication module 150 may provide a solution including wireless communication of 2G/3G/4G/5G, etc. applied to the electronic device 100. The mobile communication module 150 may include at least one filter, a switch, a power amplifier, a Low Noise Amplifier (LNA), and the like. The mobile communication module 150 may receive the electromagnetic wave from the antenna 1, filter, amplify, etc. the received electromagnetic wave, and transmit the electromagnetic wave to the modem processor for demodulation. The mobile communication module 150 may also amplify the signal modulated by the modem processor, and convert the signal into electromagnetic wave through the antenna 1 to radiate the electromagnetic wave. In some embodiments, at least some of the functional modules of the mobile communication module 150 may be disposed in the processor 110. In some embodiments, at least some of the functional modules of the mobile communication module 150 may be provided in the same device as at least some of the modules of the processor 110.

The modem processor may include a modulator and a demodulator. The modulator is used for modulating a low-frequency baseband signal to be transmitted into a medium-high frequency signal. The demodulator is used for demodulating the received electromagnetic wave signal into a low-frequency baseband signal. The demodulator then passes the demodulated low frequency baseband signal to a baseband processor for processing. The low frequency baseband signal is processed by the baseband processor and then transferred to the application processor. The application processor outputs a sound signal through an audio device (not limited to the speaker 170A, the receiver 170B, etc.) or displays an image or video through the display screen 194. In some embodiments, the modem processor may be a stand-alone device. In other embodiments, the modem processor may be provided in the same device as the mobile communication module 150 or other functional modules, independent of the processor 110.

The wireless communication module 160 may provide solutions for wireless communication applied to the electronic device 100, including Wireless Local Area Networks (WLANs) (e.g., wireless fidelity (Wi-Fi) networks), bluetooth (bluetooth, BT), global Navigation Satellite System (GNSS), frequency Modulation (FM), near Field Communication (NFC), infrared (IR), and the like. The wireless communication module 160 may be one or more devices integrating at least one communication processing module. The wireless communication module 160 receives electromagnetic waves via the antenna 2, performs frequency modulation and filtering on electromagnetic wave signals, and transmits the processed signals to the processor 110. The wireless communication module 160 may also receive a signal to be transmitted from the processor 110, perform frequency modulation and amplification on the signal, and convert the signal into electromagnetic waves via the antenna 2 to radiate the electromagnetic waves.

In some embodiments, antenna 1 of electronic device 100 is coupled to mobile communication module 150 and antenna 2 is coupled to wireless communication module 160 so that electronic device 100 can communicate with networks and other devices through wireless communication techniques. The wireless communication technology may include global system for mobile communications (GSM), general Packet Radio Service (GPRS), code division multiple access (code division multiple access, CDMA), wideband Code Division Multiple Access (WCDMA), time-division code division multiple access (time-division code division multiple access, TD-SCDMA), long Term Evolution (LTE), BT, GNSS, WLAN, NFC, FM, and/or IR technologies, etc. The GNSS may include a Global Positioning System (GPS), a global navigation satellite system (GLONASS), a beidou navigation satellite system (BDS), a quasi-zenith satellite system (QZSS), and/or a Satellite Based Augmentation System (SBAS).

The electronic device 100 implements display functions via the GPU, the display screen 194, and the application processor. The GPU is a microprocessor for image processing, and is connected to the display screen 194 and an application processor. The GPU is used to perform mathematical and geometric calculations for graphics rendering. The processor 110 may include one or more GPUs that execute program instructions to generate or alter display information.

The display screen 194 is used to display images, video, and the like. The display screen 194 includes a display panel. The display panel may adopt a Liquid Crystal Display (LCD), an organic light-emitting diode (OLED), an active-matrix organic light-emitting diode (active-matrix organic light-emitting diode, AMOLED), a flexible light-emitting diode (FLED), a miniature, a Micro-oeld, a quantum dot light-emitting diode (QLED), and the like. In some embodiments, the electronic device 100 may include 1 or N display screens 194, N being a positive integer greater than 1.

The electronic device 100 may implement a shooting function through the ISP, the camera 193, the video codec, the GPU, the display 194, the application processor, and the like.

The ISP is used to process the data fed back by the camera 193. For example, when a photo is taken, the shutter is opened, light is transmitted to the camera photosensitive element through the lens, the optical signal is converted into an electrical signal, and the camera photosensitive element transmits the electrical signal to the ISP for processing and converting into an image visible to naked eyes. The ISP can also carry out algorithm optimization on the noise, brightness and skin color of the image. The ISP can also optimize parameters such as exposure, color temperature and the like of a shooting scene. In some embodiments, the ISP may be provided in camera 193.

The camera 193 is used to capture still images or video. The object generates an optical image through the lens and projects the optical image to the photosensitive element. The photosensitive element may be a Charge Coupled Device (CCD) or a complementary metal-oxide-semiconductor (CMOS) phototransistor. The photosensitive element converts the optical signal into an electrical signal, and then transmits the electrical signal to the ISP to be converted into a digital image signal. And the ISP outputs the digital image signal to the DSP for processing. The DSP converts the digital image signal into image signal in standard RGB, YUV and other formats. In some embodiments, electronic device 100 may include 1 or N cameras 193, N being a positive integer greater than 1.

The digital signal processor is used for processing digital signals, and can process digital image signals and other digital signals. For example, when the electronic device 100 selects a frequency bin, the digital signal processor is used to perform fourier transform or the like on the frequency bin energy.

Video codecs are used to compress or decompress digital video. The electronic device 100 may support one or more video codecs. In this way, the electronic device 100 may play or record video in a variety of encoding formats, such as: moving Picture Experts Group (MPEG) 1, MPEG2, MPEG3, MPEG4, and the like.

The NPU is a neural-network (NN) computing processor that processes input information quickly by using a biological neural network structure, for example, by using a transfer mode between neurons of a human brain, and can also learn by itself continuously. Applications such as intelligent recognition of the electronic device 100 can be realized through the NPU, for example: image recognition, face recognition, speech recognition, text understanding, and the like.

The external memory interface 120 may be used to connect an external memory card, such as a Micro SD card, to extend the memory capability of the electronic device 100. The external memory card communicates with the processor 110 through the external memory interface 120 to implement a data storage function. For example, files such as music, video, etc. are saved in an external memory card.

The internal memory 121 may be used to store computer-executable program code, which includes instructions. The internal memory 121 may include a program storage area and a data storage area. The storage program area may store an operating system, an application program (such as a sound playing function, an image playing function, etc.) required by at least one function, and the like. The storage data area may store data (such as audio data, phone book, etc.) created during use of the electronic device 100, and the like. In addition, the internal memory 121 may include a high-speed random access memory, and may further include a nonvolatile memory, such as at least one magnetic disk storage device, a flash memory device, a universal flash memory (UFS), and the like. The processor 110 executes various functional applications of the electronic device 100 and data processing by executing instructions stored in the internal memory 121 and/or instructions stored in a memory provided in the processor.

The electronic device 100 may implement audio functions via the audio module 170, the speaker 170A, the receiver 170B, the microphone 170C, the headphone interface 170D, and the application processor. Such as music playing, recording, etc.

The audio module 170 is used to convert digital audio information into an analog audio signal output and also to convert an analog audio input into a digital audio signal. The audio module 170 may also be used to encode and decode audio signals. In some embodiments, the audio module 170 may be disposed in the processor 110, or some functional modules of the audio module 170 may be disposed in the processor 110.

The speaker 170A, also called a "horn", is used to convert the audio electrical signal into an acoustic signal. The electronic apparatus 100 can listen to music through the speaker 170A or listen to a handsfree call.

The receiver 170B, also called "earpiece", is used to convert the electrical audio signal into an acoustic signal. When the electronic apparatus 100 receives a call or voice information, it can receive voice by placing the receiver 170B close to the ear of the person.

The microphone 170C, also referred to as a "microphone," is used to convert sound signals into electrical signals. When making a call or transmitting voice information, the user can input a voice signal to the microphone 170C by speaking near the microphone 170C through the mouth. The electronic device 100 may be provided with at least one microphone 170C. In other embodiments, the electronic device 100 may be provided with two microphones 170C to achieve a noise reduction function in addition to collecting sound signals. In other embodiments, the electronic device 100 may further include three, four or more microphones 170C to collect sound signals, reduce noise, identify sound sources, perform directional recording, and so on.

The earphone interface 170D is used to connect a wired earphone. The headset interface 170D may be the USB interface 130, or may be a 3.5mm open mobile electronic device platform (OMTP) standard interface, a cellular telecommunications industry association (cellular telecommunications industry association of the USA, CTIA) standard interface.

The pressure sensor 180A is used for sensing a pressure signal, and can convert the pressure signal into an electrical signal. In some embodiments, the pressure sensor 180A may be disposed on the display screen 194. The pressure sensor 180A can be of a wide variety, such as a resistive pressure sensor, an inductive pressure sensor, a capacitive pressure sensor, and the like. The capacitive pressure sensor may be a sensor comprising at least two parallel plates having an electrically conductive material. When a force acts on the pressure sensor 180A, the capacitance between the electrodes changes. The electronic device 100 determines the strength of the pressure from the change in capacitance. When a touch operation is applied to the display screen 194, the electronic apparatus 100 detects the intensity of the touch operation according to the pressure sensor 180A. The electronic apparatus 100 may also calculate the touched position from the detection signal of the pressure sensor 180A. In some embodiments, the touch operations that are applied to the same touch position but have different touch operation intensities may correspond to different operation instructions. For example: and when the touch operation with the touch operation intensity smaller than the first pressure threshold value acts on the short message application icon, executing an instruction for viewing the short message. And when the touch operation with the touch operation intensity larger than or equal to the first pressure threshold value acts on the short message application icon, executing an instruction of newly building the short message.

The gyro sensor 180B may be used to determine the motion attitude of the electronic device 100. In some embodiments, the angular velocity of electronic device 100 about three axes (i.e., the x, y, and z axes) may be determined by gyroscope sensor 180B. The gyro sensor 180B may be used for photographing anti-shake. For example, when the shutter is pressed, the gyro sensor 180B detects a shake angle of the electronic device 100, calculates a distance to be compensated for by the lens module according to the shake angle, and allows the lens to counteract the shake of the electronic device 100 through a reverse movement, thereby achieving anti-shake. The gyroscope sensor 180B may also be used for navigation, somatosensory gaming scenes.

The air pressure sensor 180C is used to measure air pressure. In some embodiments, electronic device 100 calculates altitude from barometric pressure values measured by barometric pressure sensor 180C to assist in positioning and navigation.

The magnetic sensor 180D includes a hall sensor. The electronic device 100 may detect the opening and closing of the flip holster using the magnetic sensor 180D. In some embodiments, when the electronic device 100 is a flip phone, the electronic device 100 may detect the opening and closing of the flip according to the magnetic sensor 180D. And then according to the opening and closing state of the leather sheath or the opening and closing state of the flip cover, the automatic unlocking of the flip cover is set.

The acceleration sensor 180E may detect the magnitude of acceleration of the electronic device 100 in various directions (typically three axes). The magnitude and direction of gravity can be detected when the electronic device 100 is stationary. The method can also be used for recognizing the posture of the electronic equipment, and is applied to horizontal and vertical screen switching, pedometers and other applications.

A distance sensor 180F for measuring a distance. The electronic device 100 may measure the distance by infrared or laser. In some embodiments, taking a picture of a scene, electronic device 100 may utilize range sensor 180F to range for fast focus.

The proximity light sensor 180G may include, for example, a Light Emitting Diode (LED) and a light detector, such as a photodiode. The light emitting diode may be an infrared light emitting diode. The electronic device 100 emits infrared light to the outside through the light emitting diode. The electronic device 100 detects infrared reflected light from a nearby object using a photodiode. When sufficient reflected light is detected, it can be determined that there is an object near the electronic device 100. When insufficient reflected light is detected, the electronic device 100 may determine that there are no objects near the electronic device 100. The electronic device 100 can utilize the proximity light sensor 180G to detect that the user holds the electronic device 100 close to the ear for talking, so as to automatically turn off the screen to achieve the purpose of saving power. The proximity light sensor 180G may also be used in a holster mode, a pocket mode automatically unlocks and locks the screen.

The ambient light sensor 180L is used to sense ambient light brightness. Electronic device 100 may adaptively adjust the brightness of display screen 194 based on the perceived ambient light level. The ambient light sensor 180L may also be used to automatically adjust the white balance when taking a picture. The ambient light sensor 180L may also cooperate with the proximity light sensor 180G to detect whether the electronic device 100 is in a pocket to prevent accidental touches.

The fingerprint sensor 180H is used to collect a fingerprint. The electronic device 100 may utilize the collected fingerprint characteristics to unlock a fingerprint, access an application lock, photograph a fingerprint, answer an incoming call with a fingerprint, and so on.

The temperature sensor 180J is used to detect temperature. In some embodiments, electronic device 100 implements a temperature processing strategy using the temperature detected by temperature sensor 180J. For example, when the temperature reported by the temperature sensor 180J exceeds a threshold, the electronic device 100 performs a reduction in performance of a processor located near the temperature sensor 180J, so as to reduce power consumption and implement thermal protection. In other embodiments, the electronic device 100 heats the battery 142 when the temperature is below another threshold to avoid abnormal shutdown of the electronic device 100 due to low temperature. In other embodiments, when the temperature is lower than a further threshold, the electronic device 100 performs a boost on the output voltage of the battery 142 to avoid abnormal shutdown due to low temperature.

The touch sensor 180K is also called a "touch device". The touch sensor 180K may be disposed on the display screen 194, and the touch sensor 180K and the display screen 194 form a touch screen, which is also called a "touch screen". The touch sensor 180K is used to detect a touch operation acting thereon or nearby. The touch sensor may communicate the detected touch operation to the application processor to determine the touch event type. Visual output associated with the touch operation may be provided via the display screen 194. In other embodiments, the touch sensor 180K may be disposed on a surface of the electronic device 100, different from the position of the display screen 194.

The bone conduction sensor 180M may acquire a vibration signal. In some embodiments, the bone conduction sensor 180M may acquire a vibration signal of the human vocal part vibrating the bone mass. The bone conduction sensor 180M may also contact the human body pulse to receive the blood pressure pulsation signal. In some embodiments, the bone conduction sensor 180M may also be disposed in a headset, integrated into a bone conduction headset. The audio module 170 may analyze a voice signal based on the vibration signal of the bone mass vibrated by the sound part acquired by the bone conduction sensor 180M, so as to implement a voice function. The application processor can analyze heart rate information based on the blood pressure beating signal acquired by the bone conduction sensor 180M, so that the heart rate detection function is realized.

The keys 190 include a power-on key, a volume key, and the like. The keys 190 may be mechanical keys. Or may be touch keys. The electronic apparatus 100 may receive a key input, and generate a key signal input related to user setting and function control of the electronic apparatus 100.

The motor 191 may generate a vibration cue. The motor 191 may be used for incoming call vibration cues, as well as for touch vibration feedback. For example, touch operations applied to different applications (e.g., photographing, audio playing, etc.) may correspond to different vibration feedback effects. The motor 191 may also respond to different vibration feedback effects for touch operations applied to different areas of the display screen 194. Different application scenes (such as time reminding, receiving information, alarm clock, game and the like) can also correspond to different vibration feedback effects. The touch vibration feedback effect may also support customization.

Indicator 192 may be an indicator light that may be used to indicate a state of charge, a change in charge, or a message, missed call, notification, etc.

The SIM card interface 195 is used to connect a SIM card. The SIM card can be brought into and out of contact with the electronic apparatus 100 by being inserted into the SIM card interface 195 or being pulled out of the SIM card interface 195. The electronic device 100 may support 1 or N SIM card interfaces, N being a positive integer greater than 1. The SIM card interface 195 may support a Nano SIM card, a Micro SIM card, a SIM card, etc. Multiple cards can be inserted into the same SIM card interface 195 at the same time. The types of the plurality of cards may be the same or different. The SIM card interface 195 may also be compatible with different types of SIM cards. The SIM card interface 195 may also be compatible with external memory cards. The electronic device 100 interacts with the network through the SIM card to implement functions such as communication and data communication. In some embodiments, the electronic device 100 employs esims, namely: an embedded SIM card. The eSIM card can be embedded in the electronic device 100 and cannot be separated from the electronic device 100.

The software system of the electronic device 100 may employ a hierarchical architecture, an event-driven architecture, a micro-core architecture, a micro-service architecture, or a cloud architecture. The embodiment of the present invention uses an Android system with a layered architecture as an example to exemplarily illustrate a software structure of the electronic device 100.

Fig. 2 is a block diagram of a software configuration of the electronic apparatus 100 according to the embodiment of the present invention.

The layered architecture divides the software into several layers, each layer having a clear role and division of labor. The layers communicate with each other through a software interface. In some embodiments, the Android system is divided into four layers, an application layer, an application framework layer, an Android runtime (Android runtime) and system library, and a kernel layer from top to bottom.

The application layer may include a series of application packages.

As shown in fig. 2, the application package may include applications such as camera, gallery, calendar, phone call, map, navigation, WLAN, bluetooth, music, video, short message, etc.

For example, in the embodiment of the present application, the process of presenting the incoming call prompt interface, the call interface, the editing interface of the voice signal, and the like to the user may be included in the application layer.

The application framework layer provides an Application Programming Interface (API) and a programming framework for the application program of the application layer. The application framework layer includes a number of predefined functions.

For example, in the embodiment of the present application, the processes of extracting the parameter value of the speech parameter in the speech signal, synthesizing the speech signal, and the like may be included in the application framework layer.

As shown in FIG. 2, the application framework layers may include a window manager, content provider, view system, phone manager, resource manager, notification manager, and the like.

The window manager is used for managing window programs. The window manager can obtain the size of the display screen, judge whether a status bar exists, lock the screen, intercept the screen and the like.

Content providers are used to store and retrieve data and make it accessible to applications. The data may include video, images, audio, calls made and received, browsing history and bookmarks, phone books, etc.

For example, in the embodiment of the present application, the content provider may store a parameter value of a preset parameter of each incoming call number, an incoming call alert tone of the incoming call number, and the like.

The view system includes visual controls such as controls to display text, controls to display pictures, and the like. The view system may be used to build applications. The display interface may be composed of one or more views. For example, the display interface including the short message notification icon may include a view for displaying text and a view for displaying pictures.

The phone manager is used to provide communication functions for the electronic device 100. Such as management of call status (including on, off, etc.).

The resource manager provides various resources for the application, such as localized strings, icons, pictures, layout files, video files, and the like.

The notification manager enables the application to display notification information in the status bar, can be used to convey notification-type messages, can disappear automatically after a short dwell, and does not require user interaction. Such as a notification manager used to notify download completion, message alerts, etc. The notification manager may also be a notification that appears in the form of a chart or scroll bar text at the top status bar of the system, such as a notification of a background running application, or a notification that appears on the screen in the form of a dialog window. For example, prompting text information in the status bar, sounding a prompt tone, vibrating the electronic device, flashing an indicator light, etc.

The Android Runtime comprises a core library and a virtual machine. The Android runtime is responsible for scheduling and managing an Android system.

The core library comprises two parts: one part is a function which needs to be called by java language, and the other part is a core library of android.

The application layer and the application framework layer run in a virtual machine. And executing java files of the application program layer and the application program framework layer into a binary file by the virtual machine. The virtual machine is used for performing the functions of object life cycle management, stack management, thread management, safety and exception management, garbage collection and the like.

The system library may include a plurality of functional modules. For example: surface managers (surface managers), media Libraries (Media Libraries), three-dimensional graphics processing Libraries (e.g., openGL ES), 2D graphics engines (e.g., SGL), and the like.

The surface manager is used to manage the display subsystem and provide fusion of 2D and 3D layers for multiple applications.

The media library supports a variety of commonly used audio, video format playback and recording, and still image files, among others. The media library may support a variety of audio-video encoding formats, such as MPEG4, h.264, MP3, AAC, AMR, JPG, PNG, and the like.

The three-dimensional graphic processing library is used for realizing three-dimensional graphic drawing, image rendering, synthesis, layer processing and the like.

The 2D graphics engine is a drawing engine for 2D drawing.

The kernel layer is a layer between hardware and software. The inner core layer at least comprises a display driver, a camera driver, an audio driver and a sensor driver.

For convenience of understanding, in the following embodiments of the present application, an electronic device with a structure shown in fig. 1 and fig. 2 is taken as an example, and with reference to the accompanying drawings and an application scenario, an incoming call prompting method provided in the embodiments of the present application is specifically described.

Fig. 3A to fig. 3F are exemplary diagrams of Graphical User Interfaces (GUIs) of an incoming call prompting method according to an embodiment of the present application, and an example in which an electronic device is a mobile phone is taken as an example in the diagrams illustrates an incoming call prompting method according to an embodiment of the present application.

As shown in fig. 3A, when a mobile phone according to the embodiment of the present application receives a call from another mobile phone (hereinafter, referred to as a call), a call prompting interface is displayed on a screen of the mobile phone, where the call prompting interface can display a call number, an answer button, and an answer rejection (reject) button, and if the call number is recorded in an address book of the mobile phone, a contact name (not shown in the figure) corresponding to the call number can also be displayed in the call prompting interface; other contents that may be displayed in the incoming call prompt interface are not limited in the embodiment of the present application. Hereinafter, a caller is referred to as a caller, and a telephone number of a mobile phone used by the caller is referred to as an incoming number.

When the incoming call prompt interface is displayed on the screen of the mobile phone, if the incoming call notification mode of the mobile phone involves voice prompt, the mobile phone plays a prompt tone corresponding to the incoming call number. It should be noted that the incoming call prompt mode of the mobile phone may include, but is not limited to: voice prompt, voice + vibration prompt, silence, and the like. In the embodiment of the application, the prompt tone played by the mobile phone has at least the following two possibilities according to whether the mobile phone receives the incoming call of the incoming call number for the first time:

if the mobile phone receives the incoming call of the incoming call number for the first time, the mobile phone plays a prompt tone set for the incoming call number of the first incoming call, and the prompt tone can be a prompt tone set by the mobile phone by default for the incoming call number of the first incoming call or a prompt tone set by a user independently according to the preference of the user and the like;

if the mobile phone does not receive the incoming call of the incoming call number for the first time, the prompt tone played by the mobile phone is the personalized prompt tone in the embodiment of the application, namely the prompt tone with the voice characteristics of the caller to which the incoming call number belongs, so that the user can distinguish the caller more easily through the prompt tone.

Referring to fig. 3B, it is assumed that the user selects the answer button to answer the incoming call, the user makes a call with the caller corresponding to the incoming call number, and after the call is ended, the user selects the hang-up button to end the call.

The operation of the user selecting the answer button or the hang-up button may include, but is not limited to: clicking, pressing the indication direction after clicking the button, and the like, which is not limited in the embodiments of the present application.

It should be noted that, the implementation manner of answering an incoming call and ending a call by a user in fig. 3B is only an example, and other possible implementation manners are not limited in the embodiment of the present application.

When a user selects a hang-up button and finishes a call, the two situations are divided into two situations according to whether the mobile phone receives the incoming call of the incoming call number for the first time:

the first condition is as follows: if the mobile phone receives the incoming call of the incoming call number for the first time, the user selects the hang-up button, after the call is finished, the mobile phone can extract the parameter value of the preset voice parameter according to the voice signal of the caller in the call process, the parameter value of the preset voice parameter can reflect the voice feature of the caller, and then voice synthesis is carried out according to the extracted parameter value to obtain the voice signal with the voice feature of the caller.

For example: supposing that the caller is used to repeat a word "double-voice" and the voice is raised when speaking, the voice signal with the voice characteristic of the caller obtained by the voice synthesis of the mobile phone may comprise the voice signal of double-voice, and the pronunciation of double-voice in the voice signal is raised according to the speaking habit of the caller, so that the voice characteristic of the caller is provided; supposing that the speaking of the caller has accent and the words with two voices have special pronunciation habits, the voice signal with the voice characteristics of the caller obtained by the voice synthesis of the mobile phone may contain the voice signal with the words with two voices, and the pronunciation of the words with two voices in the voice signal has the special pronunciation habits of the caller and has the voice characteristics of the caller; and so on.

For the voice signal obtained by voice synthesis, the mobile phone can store the voice signal as the prompt tone of the incoming call number 1234567891, and the mobile phone can play the prompt tone when the mobile phone receives the incoming call of the incoming call number 1234567891 next time.

Alternatively, the user may have the option of storing and editing the alert tone of the incoming call number, so that the user has a better user experience, as shown in fig. 3C, the mobile phone may prompt the user that the alert tone of the incoming call number 1234567891 has been generated, ask the user whether to edit the alert tone, or set the alert tone as the incoming call tone of the incoming call number 1234567891, by, for example, displaying a prompt box to the user. In the prompt box, three possible buttons as shown in fig. 3C may be set for the user to select, editing, setting, and canceling, respectively; wherein the content of the first and second substances,

if the user selects the edit button, the mobile phone can display a prompt tone edit interface on the screen for the user. In a possible implementation manner of the editing interface, the parameter value of the preset voice parameter used in synthesizing the voice signal can be displayed to the user in the editing interface, so that the user can modify the parameter value to achieve the purpose of editing the voice signal. However, it is too obscure to show the parameter value of the preset speech parameter to the user, and unless the user is a professional, it is difficult for the user to conveniently and quickly achieve the editing purpose by modifying the parameter value of the preset speech parameter, for this reason, as shown in fig. 3D, the editing interface may also show the user a more intuitive editing option instead of showing the parameter value of the preset speech parameter used when synthesizing the speech signal, such as the sound shown in fig. 3D being bold, thin, rally, and taiyang. The editing options described above correspond to different modification strategies for the parameter values of the speech parameters. For example, the edit option of sound-up may correspond to a modification strategy in which the parameter value of the baseband frequency becomes smaller and the parameter value of the subband signal strength becomes larger, and the edit option of sound-down may correspond to a modification strategy in which the parameter value of the baseband frequency becomes larger and the parameter value of the subband signal strength becomes smaller. The user selects the editing option, the mobile phone correspondingly modifies the parameter value of the voice parameter used when the voice signal is synthesized according to the modification strategy of the editing option, the voice synthesis is carried out again according to the modified parameter value of the voice parameter to obtain a new voice signal, until the user selects a determination button in the editing interface, the voice signal finally obtained by the user is set as the incoming call prompt tone of the incoming call number 1234567891, or the user selects an exit button in the editing interface, the mobile phone does not store the incoming call prompt tone of the incoming call number 1234567891, the incoming call number 1234567891 is received next time, the mobile phone still considers that the incoming call number 1234567891 is received for the first time, and the prompt tone set by the user for the incoming call number of the first incoming call is played by the mobile phone.

If the user selects the setting button in the prompt box, the mobile phone directly sets the voice signal obtained by voice synthesis as the incoming call prompt tone of the incoming call number 1234567891, and when the incoming call of the incoming call number 1234567891 is received next time, the prompt tone can be played, so that the user can easily identify the caller.

If the user selects the cancel button in the prompt box, the mobile phone does not store the incoming call prompt tone of the incoming call number 1234567891, the next incoming call of the incoming call number 1234567891 is received, the mobile phone still considers that the incoming call of the incoming call number 1234567891 is received for the first time, and the prompt tone set by the user for the incoming call number of the first incoming call is played by the mobile phone.

Case two: if the mobile phone does not receive the incoming call of the incoming call number for the first time, the mobile phone can extract and store the parameter value of the preset voice parameter according to the voice signal of the caller in the process of the conversation between the user and the caller.

As the number of incoming calls of the same incoming call number increases, the parameter values of the preset voice parameters correspondingly stored in the mobile phone are more and more for the incoming call number, the same voice parameter may be extracted for multiple times, and the more times the same parameter value of the same voice parameter is extracted, the more the parameter value can represent the voice characteristics of the caller; when the mobile phone judges that the parameter value used for synthesizing the prompt tone of the caller number before deviates from the parameter value capable of representing the voice feature of the caller, so that the voice feature of the caller cannot be well represented, the mobile phone can select the parameter value of each preset voice parameter capable of representing the voice feature of the caller from the stored parameter values of the preset voice parameters, and synthesize a new voice signal capable of representing the voice feature of the caller again by voice, at the moment:

after extracting and storing the parameter value of the preset voice parameter according to the voice signal of the caller in the call process, the mobile phone can judge whether the incoming call prompt tone of the incoming call number can represent the voice characteristics of the caller, and if so, the mobile phone finishes the incoming call prompt process of the embodiment of the application; if not, the mobile phone performs speech synthesis again to obtain a new speech signal, and presents a prompt box to the user, for example, as shown in fig. 3E, prompting the user that a new prompt tone has been generated for the caller id number 1234567891, and asking the user whether to edit the prompt tone or set the prompt tone as the caller id number 1234567891. The buttons set in the prompt box are similar to those in fig. 3C, and the processing procedure of the mobile phone when the user selects a certain button is also similar to that in fig. 3C, which is not described again here. It should be noted that, if the user selects the edit button in the prompt box, a prompt tone edit interface similar to that shown in fig. 3D may also be entered, which is not described herein again.

It should be noted that, in order to increase the user's listening experience to the voice signal with the voice characteristic of the caller, background music, preferably pure music, may be added to the voice signal when synthesizing the voice signal with the voice characteristic of the caller, so that the user can distinguish the caller based on the voice characteristic when listening to the incoming call prompt tone, and the user can not feel as sharp when just listening to the incoming call prompt tone.

It should be noted that, in practical applications, for the incoming call prompt tone of each incoming call number stored in the mobile phone, an editing interface may also be set in the mobile phone, for example, as shown in fig. 3F, and a user autonomously enters the prompt tone editing interface in fig. 3D, for example, to edit and reset the incoming call prompt tone of a certain incoming call number, which is not described herein again.

Fig. 4 is a schematic processing process diagram of a method for prompting an incoming call according to an embodiment of the present application, and an example of the electronic device is a mobile phone is described. Specifically, as shown in fig. 4, the incoming call prompting method may include the following processing procedures:

step 401: the mobile phone receives an incoming call, acquires an incoming call number, judges whether the incoming call of the incoming call number is received for the first time, if so, executes step 402, and if not, executes step 406.

The mobile phone judges whether the call receiving the incoming call number is received for the first time or not by judging whether the incoming call prompt tone of the incoming call number is stored in the mobile phone or not, if the incoming call prompt tone of the incoming call number is stored, the mobile phone judges that the call receiving the incoming call number is not received for the first time, and if the incoming call prompt tone of the incoming call number is not stored, the mobile phone judges that the call receiving the incoming call number is received for the first time. It should be noted that the incoming call prompt tone of the incoming call number does not refer to the incoming call prompt tone uniformly set by the user for the incoming call number of the first incoming call, but refers to the incoming call prompt tone synthesized for the incoming call number in the embodiment of the present application and having the voice feature of the caller to which the incoming call number belongs.

Step 402: the mobile phone plays a calling prompt tone of a first incoming call set by a user, the incoming call is connected when a connection instruction of the user is received, and the incoming call is disconnected when a voice call ending instruction of the user is received in the voice call.

The incoming call prompt tone played by the mobile phone in this step is a prompt tone set by the user for the incoming call number of the first incoming call, and is not necessarily a prompt tone generated according to the voice feature of the caller in this embodiment of the application.

The above steps 401 to 402 may correspond to fig. 3A and fig. 3B, for example, in fig. 3B, the user selects an answer button, and correspondingly, the mobile phone receives a user connection instruction, and the user selects a hang-up button, and correspondingly, the mobile phone receives a user voice call end instruction.

Step 403: the mobile phone extracts and stores the parameter value of the preset voice parameter according to the voice signal of the caller in the voice communication process between the user and the caller.

In this step, the voice signal of the caller during the voice communication is also: and voice signals of other communication parties except the user of the mobile phone in the voice communication process.

Referring to the foregoing description, the parameter value of the preset voice parameter may characterize the voice characteristics of the caller. The speech parameters may include, but are not limited to: voiced flag bit, speech period, linear prediction frequency, sub-band signal strength, signal gain, and/or baseband period of the speech signal, etc.

The mobile phone can sample the voice signal of the caller in order to reduce the data processing amount of the mobile phone under the condition that the voice call duration is long, and extract the parameter value of the preset voice parameter according to the sampled voice signal (hereinafter referred to as a sampling signal).

The embodiment of the present application is not limited to the rule for sampling the voice signal of the caller, and the number of sampling times and the time length of the sampling signal sampled each time are not limited to the embodiment of the present application.

Optionally, the target total duration of the sampling signal may be preset, and the specific value of the target total duration is not limited in the embodiment of the present application, for example, a suitable value may be selected on the principle that a parameter value of a preset voice parameter can be extracted and an excessive data processing pressure is not added to the mobile phone. Since the voice call duration between the user and the caller has randomness, although the target total duration is set, the case where the total duration of the finally obtained sampling signal is smaller than the target total duration is not excluded.

In a possible implementation manner, in order to make the total duration of the sampling signal reach or approach the target total duration as much as possible, sampling may be performed only once, for example, sampling may be performed from the start time of the voice call between the user and the caller (i.e., the time when the mobile phone is connected to the incoming call) until the duration of the sampling signal reaches the target total duration or the voice call is ended, so as to obtain the sampling signal.

In another possible implementation manner, in order to take into account the characteristics and interest of the target total duration and the caller's language in the sampled signal, the voice signal of the caller during the voice call may also be selectively sampled for multiple times, the obtained total duration of the sampled signal reaches the target total duration or the voice call is ended, and the sampling of the voice signal of the caller is ended. For example, a voice signal with a signal strength exceeding a preset strength threshold value in the voice signal of the caller may be sampled each time; alternatively, a speech signal corresponding to a key utterance or an interesting utterance, which may include but is not limited to: words with a high repetition rate, words with a preset vocabulary, and the like. The predetermined words may include, but are not limited to: place name, article name, special vocabulary, etc. Optionally, the preset vocabulary may be set autonomously by a user, for example, providing a setting interface for the user, and the embodiment of the present application is not limited in particular. The special words may be some interesting words that the user likes, such as network popular words.

It should be noted that the user may preset the words to be sampled, or may preset words not to be sampled, such as preset names of people, and/or words or sentences that some users find to be disliked or play in public places, which may cause embarrassment or reveal privacy, etc.

If the voice signal of the caller is sampled for multiple times, the sampled signals sampled for multiple times can be spliced to obtain the spliced sampled signals, and the mobile phone can extract the parameter values of the preset voice parameters according to the spliced sampled signals.

The embodiment of the present application is not limited to the method for the mobile phone to extract the parameter value of the preset voice parameter according to the voice signal, for example, the spliced sampling signal.

When the extracted parameter value is stored, the mobile phone can store the parameter value based on the incoming call number. Specifically, for each incoming call number, the parameter value of the preset voice parameter may be stored in a manner as shown in table 1 below. In this step, the incoming call of the incoming call number is received for the first time, so that there is only one parameter under each voice parameter in table 1.

TABLE 1

The weight can be set for each parameter value of each voice parameter, and the value of the weight is in direct proportion to the occurrence frequency of the parameter value. Based on the foregoing description and with reference to subsequent step 406, a parameter value of each speech parameter may be extracted after each incoming call number, and for each speech parameter, the extracted parameter value may be the same as or different from the extracted parameter value of each previous speech parameter. Along with the increasing number of calls made by the caller number, the number of times of appearance of a certain extracted parameter value of the same voice parameter may be increased, correspondingly, the weight of the parameter value is increased, the more times of extraction of a parameter value of the same voice parameter, the higher the weight is, the more the parameter value can represent the voice characteristics of the caller.

Step 404: and the mobile phone performs voice synthesis according to the parameter values extracted in the step 403 to obtain a first voice signal.

The first voice signal obtained in this step is a voice signal capable of representing the voice characteristics of the caller.

The parameter value according to which the speech synthesis is performed in this step is the parameter value extracted in step 403.

The embodiment of the present application is not limited to the method for performing speech synthesis by a mobile phone according to the parameter value of the speech parameter.

Step 405: and the mobile phone sets the incoming call prompt tone of the incoming call number according to the first voice signal, and the branching process is finished.

In a possible implementation manner, the mobile phone may directly set the first voice signal as an incoming call prompt tone of the incoming call number.

In another possible implementation manner, the mobile phone may provide an editing interface for the first voice signal for the user, so that the user can edit the first voice signal, which may specifically refer to corresponding descriptions in fig. 3C and fig. 3D, which is not described herein again.

Based on the above processing, the mobile phone will record the incoming call prompt tone of the incoming call number, and when the incoming call number next calls, the mobile phone determines that the incoming call of the incoming call number is not received for the first time, and will execute step 406 and the following steps.

Step 406: the mobile phone plays the incoming call prompt tone of the incoming call number, the incoming call is connected when the connection indication of the user is received, and the incoming call is disconnected when the voice call ending indication of the user is received.

In this step, the incoming call prompt tone played by the mobile phone is a voice signal synthesized in the embodiment of the present application and having the voice feature of the caller to which the incoming call number belongs, and the user can easily identify the caller according to the voice feature of the caller in the incoming call prompt tone.

Step 407: the mobile phone extracts and stores parameter values of preset voice parameters according to voice signals of a caller in the voice communication process between a user and the caller.

In this step, for the specific implementation of the mobile phone extracting the parameter value of the preset voice parameter according to the voice signal of the caller during the voice communication between the user and the caller, reference may be made to the corresponding description in step 403, which is not described herein again.

The implementation of storing parameter values is explained below.

As the number of times of incoming calls of the same caller id increases, for the same caller id, the voice characteristics of the caller to which the caller id belongs, which are stored in the mobile phone, may be as shown in table 2 below.

TABLE 2

As can be seen from comparing table 1 and table 2, as the number of incoming calls of the same caller number increases, the extracted parameter value of the same voice parameter increases, and for the same voice parameter, the extracted parameter value may be the same or different each time, but since the parameter value represents the voice feature of the caller, and the voice feature of the caller is relatively fixed and is not easy to change, the following features are generally presented finally: for the same incoming call number, the frequency of occurrence of a certain parameter value of the same voice parameter gradually increases along with the increase of the incoming call frequency of the incoming call number, and correspondingly, the weight of the parameter value also increases. The more the weight of a parameter value of the same voice parameter is, the more the parameter value can represent the voice characteristics of the caller.

Step 408: the mobile phone judges whether a prompt tone updating condition is met according to the stored parameter value of each preset voice parameter corresponding to the incoming call number, and if the prompt tone updating condition is met, voice synthesis is carried out according to the parameter value with the maximum weight of the preset voice parameter corresponding to the incoming call number to obtain a second voice signal.

If the judgment in this step does not satisfy the condition for updating the warning tone, this branching process ends, which is not described herein again.

With the above table 1 and table 2, as the number of incoming calls of the same incoming call number increases, the parameter value of each voice parameter according to which the incoming call alert tone of the incoming call number has been set in the mobile phone when being synthesized may no longer be the parameter value with the largest weight, so that the incoming call alert tone of the incoming call number has been set in the mobile phone may not best represent the voice feature of the caller. Based on the above description, the condition for updating the alert tone in this step may include, but is not limited to: at least one parameter value according to which the incoming call prompt tone of the incoming call number is set in the mobile phone when being synthesized is no longer the parameter value with the maximum weight of the voice parameter.

For example, in step 404, a first speech signal is synthesized according to the parameter values A1, B1, C1, D1, E1, and F1 in table 1, but it is obvious that A1 and F1 in table 2 are no longer the parameter values with the largest weight, and cannot better characterize the speech characteristics of the caller, so that in step 409, a second speech signal can be synthesized again according to A3, B1, C1, D1, E1, and F3.

Step 409: and the mobile phone sets the incoming call prompt tone of the incoming call number according to the second voice signal, and the branching process is finished.

The implementation of this step may refer to the corresponding description in step 405, which is not described herein again.

This step may correspond to fig. 3E.

It should be noted that, unlike the above-mentioned method for prompting an incoming call in the present embodiment, in the method for prompting an incoming call in the present embodiment, the parameter value of the preset voice parameter may be extracted according to the voice signal of the caller when the incoming call is received by the incoming call number, for example, the parameter value of the preset voice parameter may be extracted at intervals, that is, the parameter value of the preset voice parameter is extracted according to the voice signal of the caller when the incoming call number is received for the first time, the parameter value of the preset voice parameter is extracted according to the voice signal of the caller when the incoming call number is received for the third time, and so on.

It should be noted that, based on the above description in the embodiment of the present application, as the number of incoming calls of the same incoming call number increases, the extracted parameter values of the same voice parameter increase, and finally the following characteristics are generally presented: for the same voice parameter of the same incoming call number, the frequency of a certain parameter value gradually increases along with the increase of the incoming call frequency of the incoming call number, becomes the parameter value with the maximum weight, and is always kept as the parameter value with the maximum weight; based on this, a frequency threshold value can be set in the mobile phone, when the frequency of extracting the parameter value of the preset voice parameter according to the voice signal of the caller to which the incoming call number belongs by the mobile phone exceeds the frequency threshold value, the mobile phone receives the incoming call of the incoming call number, and the mobile phone does not extract the parameter value of the preset voice parameter according to the voice signal of the caller to which the incoming call number belongs. The above-mentioned value of the number threshold is not limited in this embodiment, but, relatively speaking, the larger the number threshold is, the more the incoming call prompt tone corresponding to the incoming call number can represent the voice characteristics of the caller to which the incoming call number belongs.

According to the incoming call prompting method, different incoming call prompting tones are set for different incoming call numbers, and the incoming call prompting tones have the voice characteristics of the caller to which the incoming call number belongs, so that the incoming call prompting tones have individuation, and different individualized prompting tones can be used for different incoming call numbers; the incoming call prompt tone of the embodiment of the application has the voice characteristics of the caller to which the incoming call number belongs, so that a user can distinguish the caller relatively easily through the voice characteristics of the incoming call prompt tone played during incoming call.

Moreover, when the parameter value of the voice parameter representing the voice characteristic of the caller is extracted, the semantic meaning of the voice signal of the caller can not be analyzed, so that the privacy of a user is not involved; the voice characteristics represented by the preset voice parameters mainly include the voice speed and rhythm of the caller and statement information set by some users, so that the voice characteristics of the caller are kept, and the safety leakage of personal information caused by playing the incoming call prompt tone can be avoided.

With reference to the foregoing embodiments and the related drawings, the embodiments of the present application provide an incoming call prompting method, which can be implemented in an electronic device (e.g., a mobile phone, a car phone, etc.) with a voice call function as shown in fig. 1 and fig. 2. Fig. 5 is a flowchart of an incoming call prompting method provided in an embodiment of the present application, and as shown in fig. 5, the method may include the following steps:

step 501: and acquiring voice signals of other parties except the user of the electronic equipment in the voice call process.

Here, the caller may be the caller as described in the above embodiments.

Step 502: extracting parameter values of preset voice parameters from the voice signals; performing voice synthesis according to the extracted parameter value of the preset voice parameter to obtain a first voice signal; and setting an incoming call prompt tone of the incoming call number of the calling party according to the first voice signal.

Step 503: and receiving the incoming call of the incoming call number, determining that the incoming call prompt mode comprises voice prompt, and playing the incoming call prompt tone of the incoming call number.

Optionally, the method may further include:

if yes, executing the step of performing voice synthesis according to the extracted parameter value of the preset voice parameter in the step 502;

Optionally, before performing voice synthesis according to the parameter value of the preset voice parameter extracted from the voice signal and the parameter value of the preset voice parameter corresponding to the incoming call number, the method further includes:

Optionally, performing speech synthesis according to the parameter value of the preset speech parameter extracted from the speech signal and the parameter value of the preset speech parameter corresponding to the incoming call number, may include:

Optionally, the setting of the incoming call prompt tone of the incoming call number according to the second voice signal may include:

The editing interface of the second voice signal may refer to the editing interface illustrated in fig. 3D.

Optionally, the step 502 of setting an incoming call prompt tone of the incoming call number according to the first voice signal may include:

setting the first voice signal as an incoming call prompt tone of an incoming call number; alternatively, the first and second liquid crystal display panels may be,

displaying an editing interface of the first voice signal for a user, and carrying out voice synthesis again according to the editing operation of the user in the editing interface to obtain a fourth voice signal; and when receiving a setting confirmation operation of the user in the editing interface, setting the fourth voice signal as an incoming call prompt tone of the incoming call number.

Wherein the editing interface of the first voice signal may refer to the editing interface illustrated in fig. 3D.

Optionally, the extracting a parameter value of the preset speech parameter from the speech signal in step 502 may include:

Optionally, the sampling the voice signal according to the preset sampling rule may include:

sampling a voice signal from the beginning of the voice call until the time length of the sampled voice signal reaches a preset target total time length or the voice call is ended; alternatively, the first and second liquid crystal display panels may be,

and sampling the voice signals with preset vocabularies in the voice signals until the total time length of the sampling signals reaches a preset target total time length or the voice call is ended.

The implementation of each step in fig. 5 may refer to the corresponding description in the embodiments shown in fig. 3A to fig. 4, and is not described again here.

It is to be understood that some or all of the steps or operations in the above-described embodiments are merely examples, and other operations or variations of various operations may be performed by the embodiments of the present application. Further, the various steps may be performed in a different order presented in the above embodiments, and not all of the operations in the above embodiments may be performed.

Fig. 6 is a schematic structural diagram of an embodiment of the incoming call prompt device of the present application, and as shown in fig. 6, the device 60 may include:

an acquisition unit 61 configured to acquire a voice signal of a calling party other than a user of the electronic device during a voice call;

a prompt tone setting unit 62 for extracting a parameter value of a preset voice parameter from the voice signal; performing voice synthesis according to the extracted parameter value of the preset voice parameter to obtain a first voice signal; setting an incoming call prompt tone of the incoming call number of the calling party according to the first voice signal;

and an alert tone playing unit 63, configured to receive an incoming call of the incoming call number, determine that the incoming call alert mode includes a voice alert, and play an incoming call alert tone of the incoming call number.

Optionally, the method may further include:

the judging unit is used for judging whether an incoming call of the incoming call number of a calling party is received for the first time;

the alert tone setting unit 62 may be specifically configured to: if the judgment result of the judgment unit is yes, carrying out voice synthesis according to the extracted parameter value of the preset voice parameter; if not, performing voice synthesis according to the parameter value of the preset voice parameter extracted from the voice signal and the parameter value of the preset voice parameter corresponding to the incoming call number to obtain a second voice signal; the parameter value of the preset voice parameter corresponding to the incoming call number is extracted from the voice signal of the calling party in at least one voice communication process before the voice communication process, and the incoming call prompt tone of the incoming call number is set according to the second voice signal.

Optionally, the alert tone setting unit 62 may be further configured to: and judging whether the incoming call prompt tone updating condition of the incoming call number is met according to the parameter value of the preset voice parameter extracted from the voice signal and the parameter value of the preset voice parameter corresponding to the incoming call number.

Optionally, the alert tone setting unit 62 may be specifically configured to: updating the parameter value of the preset voice parameter corresponding to the incoming call number by using the parameter value of the preset voice parameter extracted from the voice signal; and carrying out voice synthesis according to the parameter value with the maximum weight of each preset voice parameter corresponding to the incoming call number.

Optionally, the prompt tone setting unit 62 may specifically be configured to: setting the second voice signal as an incoming call prompt tone of the incoming call number; or displaying an editing interface of the second voice signal for the user, and carrying out voice synthesis again according to the editing operation of the user in the editing interface to obtain a third voice signal; and when receiving a setting confirmation operation of the user in the editing interface, setting the third voice signal as an incoming call prompt tone of the incoming call number.

Optionally, the alert tone setting unit 62 may be specifically configured to: setting the first voice signal as an incoming call prompt tone of an incoming call number; or displaying an editing interface of the first voice signal for the user, and carrying out voice synthesis again according to the editing operation of the user in the editing interface to obtain a fourth voice signal; and when receiving the setting confirmation operation of the user in the editing interface, setting the fourth voice signal as the incoming call prompt tone of the incoming call number.

The editing interface of the first voice signal may refer to the editing interface illustrated in fig. 3D.

Optionally, the alert tone setting unit 62 may be specifically configured to: sampling a voice signal according to a preset sampling rule to obtain a sampling signal; and extracting the parameter value of the preset voice parameter according to the sampling signal.

Optionally, the alert tone setting unit 62 may be specifically configured to: starting to sample the voice signal from the call starting time of the voice call until the time length of the sampled voice signal reaches the preset target total time length or the voice call is ended; alternatively, the first and second electrodes may be,

sampling voice signals of which the signal intensity exceeds a preset intensity threshold value in the voice signals until the total duration of the sampled signals reaches a preset target total duration or the voice call is ended; or sampling the voice signals with preset vocabularies in the voice signals until the total time length of the sampling signals reaches the preset target total time length or the voice call is ended.

The apparatus provided in the embodiment shown in fig. 6 may be used to implement the technical solutions of the method embodiments shown in fig. 3A to fig. 5 in this application, and the implementation principles and technical effects of the apparatus may further refer to the related descriptions in the method embodiments.

It should be understood that the above division of the units of the apparatus shown in fig. 6 is only a logical division, and the actual implementation may be wholly or partially integrated into one physical entity, or may be physically separated. And these units can be implemented entirely in software, invoked by a processing element; or may be implemented entirely in hardware; part of the units can also be realized in the form of software called by a processing element, and part of the units can be realized in the form of hardware. For example, the obtaining unit may be a processing element that is separately set up, or may be integrated into a chip of the electronic device. The other units are implemented similarly. In addition, all or part of the units can be integrated together or can be independently realized. In implementation, the steps of the method or the units above may be implemented by hardware integrated logic circuits in a processor element or instructions in software.

For example, the above units may be one or more integrated circuits configured to implement the above methods, such as: one or more Application Specific Integrated Circuits (ASICs), or one or more microprocessors (DSPs), or one or more Field Programmable Gate Arrays (FPGAs), among others. For another example, these units may be integrated together and implemented in the form of a System-On-a-Chip (SOC).

An embodiment of the present application further provides an electronic device, including: one or more processors; a memory; and one or more computer programs, wherein the one or more computer programs are stored in the memory, the one or more computer programs comprising instructions, which when executed by the apparatus, cause the apparatus to perform the steps of:

extracting parameter values of preset voice parameters from the voice signals; performing voice synthesis according to the extracted parameter value of the preset voice parameter to obtain a first voice signal; setting an incoming call prompt tone of an incoming call number of a calling party according to the first voice signal;

Optionally, the instructions, when executed by the device, cause the device to further perform the steps of:

and setting an incoming call prompt tone of the incoming call number according to the second voice signal.

Optionally, before the step of performing, by the apparatus, speech synthesis according to the parameter value of the preset speech parameter extracted from the speech signal and the parameter value of the preset speech parameter corresponding to the incoming call number, the instruction may further include:

Optionally, when the instruction is executed by the apparatus, the apparatus may perform a step of performing speech synthesis according to a parameter value of a preset speech parameter extracted from the speech signal and a parameter value of a preset speech parameter corresponding to the incoming call number, where the step may include:

Alternatively, the instructions, when executed by the apparatus, cause the apparatus to perform the step of setting an incoming call alert tone of an incoming call number according to the second voice signal, may include:

setting the second voice signal as an incoming call prompt tone of the incoming call number; alternatively, the first and second liquid crystal display panels may be,

Alternatively, the instructions, when executed by the apparatus, cause the apparatus to perform the step of setting an incoming call alert tone of an incoming call number according to the first voice signal, may include:

Alternatively, the instructions, when executed by the apparatus, cause the apparatus to perform the step of extracting a parameter value of the preset speech parameter from the speech signal, may include:

Optionally, the instructions, when executed by the apparatus, cause the apparatus to perform the step of sampling the speech signal according to a preset sampling rule, may include:

starting to sample the voice signal from the call starting time of the voice call until the time length of the sampled voice signal reaches the preset target total time length or the voice call is ended; alternatively, the first and second liquid crystal display panels may be,

sampling voice signals of which the signal intensity exceeds a preset intensity threshold value in the voice signals until the total duration of the sampled signals reaches a preset target total duration or the voice call is ended; alternatively, the first and second liquid crystal display panels may be,

Embodiments of the present application further provide a computer-readable storage medium, in which a computer program is stored, and when the computer program runs on a computer, the computer is caused to execute the method provided in the embodiments shown in fig. 3A to 5 of the present application.

Embodiments of the present application further provide a computer program product, which includes a computer program and when the computer program runs on a computer, the computer is caused to execute the method provided by the embodiments shown in fig. 3A to fig. 5 of the present application.

In the embodiments of the present application, "at least one" means one or more, and "a plurality" means two or more. "and/or" describes the association relationship of the associated objects, and means that there may be three relationships, for example, a and/or B, and may mean that a exists alone, a and B exist simultaneously, and B exists alone. Wherein A and B can be singular or plural. The character "/" generally indicates that the former and latter associated objects are in an "or" relationship. "at least one of the following" and the like, refer to any combination of these items, including any combination of singular or plural items. For example, at least one of a, b, and c may represent: a, b, c, a and b, a and c, b and c or a and b and c, wherein a, b and c can be single or multiple.

Those of ordinary skill in the art will appreciate that the various elements and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, computer software, or combinations of electronic hardware and computer software. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the technical solution. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present application.

It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the above-described systems, apparatuses and units may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.

In the several embodiments provided in the present application, any function, if implemented in the form of a software functional unit and sold or used as a separate product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present application or portions thereof that substantially contribute to the prior art may be embodied in the form of a software product stored in a storage medium and including instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present application. And the aforementioned storage medium includes: a U disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.

The above description is only an embodiment of the present application, and any person skilled in the art can easily conceive of changes or substitutions within the technical scope of the present disclosure, and all of them should be covered by the protection scope of the present application. The protection scope of the present application shall be subject to the protection scope of the claims.

Claims

1. A method for prompting an incoming call is applied to electronic equipment with a conversation function, and is characterized by comprising the following steps:

acquiring voice signals of calling parties except a user of the electronic equipment in the voice calling process of a first incoming call;

extracting parameter values of preset voice parameters from the voice signals;

when an incoming call of the incoming call number of the calling party is received for the first time, carrying out voice synthesis according to the extracted parameter value of the preset voice parameter to obtain a first voice signal; setting an incoming call prompt tone of the incoming call number of the calling party according to the first voice signal; the preset voice parameters include: a voiced flag bit, a speech period, a linear prediction frequency, a sub-band signal strength, a signal gain, and/or a baseband period of the speech signal;

when an incoming call of the incoming call number of the calling party is not received for the first time, performing voice synthesis according to the parameter value of the preset voice parameter extracted from the voice signal and the parameter value of the preset voice parameter corresponding to the incoming call number to obtain a second voice signal; extracting a parameter value of a preset voice parameter corresponding to the incoming call number from a voice signal of the calling party in at least one voice call process before the voice call process; setting an incoming call prompt tone of the incoming call number according to the second voice signal;

receiving a second incoming call of the incoming call number, determining that an incoming call prompt mode comprises a voice prompt, and playing an incoming call prompt tone of the incoming call number; the second incoming call is the next incoming call of the first incoming call;

the voice synthesis according to the parameter value of the preset voice parameter extracted from the voice signal and the parameter value of the preset voice parameter corresponding to the incoming call number includes:

updating the parameter value of a preset voice parameter corresponding to the incoming call number and the weight of each parameter value by using the parameter value of the preset voice parameter extracted from the voice signal; the weight of the parameter value is in direct proportion to the occurrence frequency of the parameter value;

2. The method according to claim 1, wherein before performing speech synthesis according to the parameter value of the preset speech parameter extracted from the speech signal and the parameter value of the preset speech parameter corresponding to the incoming call number, the method further comprises:

3. The method according to claim 1, wherein said setting an incoming call prompt tone of the incoming call number according to the second voice signal comprises:

displaying an editing interface of the second voice signal for the user, and carrying out voice synthesis again according to the editing operation of the user in the editing interface to obtain a third voice signal; and when receiving a setting confirmation operation of the user in the editing interface, setting the third voice signal as an incoming call prompt tone of the incoming call number.

4. The method according to any one of claims 1 to 3, wherein the setting of the incoming call prompt tone of the incoming call number according to the first voice signal comprises:

setting the first voice signal as an incoming call prompt tone of the incoming call number; alternatively, the first and second liquid crystal display panels may be,

displaying an editing interface of the first voice signal for the user, and carrying out voice synthesis again according to the editing operation of the user in the editing interface to obtain a fourth voice signal; and when receiving a setting confirmation operation of the user in the editing interface, setting the fourth voice signal as an incoming call prompt tone of the incoming call number.

5. The method according to any one of claims 1 to 3, wherein the extracting the parameter value of the preset voice parameter from the voice signal comprises:

sampling the voice signal according to a preset sampling rule to obtain a sampling signal;

and extracting the parameter value of a preset voice parameter according to the sampling signal.

6. The method of claim 5, wherein sampling the speech signal according to a preset sampling rule comprises:

sampling the voice signal from the starting time of the voice call until the time length of the sampled voice signal reaches a preset target total time length or the voice call is finished; alternatively, the first and second electrodes may be,

and sampling the voice signals with preset words in the voice signals until the total time length of the sampled signals reaches a preset target total time length or the voice call is ended.

7. An electronic device, comprising:

one or more processors; a memory; and one or more computer programs, wherein the one or more computer programs are stored in the memory, the one or more computer programs comprising instructions which, when executed by the apparatus, cause the apparatus to perform the method of any of claims 1 to 6.

8. A computer-readable storage medium, in which a computer program is stored which, when run on a computer, causes the computer to carry out the method according to any one of claims 1 to 6.