CN116405594B - Voice communication method and device - Google Patents

Voice communication method and device Download PDF

Info

Publication number
CN116405594B
CN116405594B CN202310671244.1A CN202310671244A CN116405594B CN 116405594 B CN116405594 B CN 116405594B CN 202310671244 A CN202310671244 A CN 202310671244A CN 116405594 B CN116405594 B CN 116405594B
Authority
CN
China
Prior art keywords
called
rtp packet
calling
equipment
color ring
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202310671244.1A
Other languages
Chinese (zh)
Other versions
CN116405594A (en
Inventor
付海刚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Honor Device Co Ltd
Original Assignee
Honor Device Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Honor Device Co Ltd filed Critical Honor Device Co Ltd
Priority to CN202310671244.1A priority Critical patent/CN116405594B/en
Publication of CN116405594A publication Critical patent/CN116405594A/en
Application granted granted Critical
Publication of CN116405594B publication Critical patent/CN116405594B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72448User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions
    • H04M1/72454User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions according to context-related or environment-related conditions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/42017Customized ring-back tones

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Environmental & Geological Engineering (AREA)
  • Human Computer Interaction (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Telephone Function (AREA)

Abstract

The application provides a voice call method and a voice call device, which can solve single-pass faults between two parties of a voice call as far as possible, save the voice call as far as possible, enable the voice call to be normally carried out, and improve call experience of the two parties of the voice call. The method comprises the following steps: the calling device sends an invitation request of voice call to the called device, receives ringing notification from the called device, judges whether the calling device is in a call abnormal scene currently based on whether the called device has color ring service and the source of the received RTP packet, if so, the calling device sends a confirmation message to the called device, and adjusts the local call state to be connected.

Description

Voice communication method and device
Technical Field
The present application relates to the field of communications, and in particular, to a method and apparatus for voice call.
Background
The voice call is a basic function of the terminal device, and the calling device and the called device may be any devices having a voice call function, such as a mobile phone. Before the voice call is carried out, the calling device and the called device can establish communication connection through a network to complete a signaling interaction flow before the call so as to ensure the follow-up normal call of the calling party and the called party.
In a call anomaly scenario, a calling party initiates a call invitation to a called party, after the called party connects the call invitation of the calling party, the called equipment displays that the call is connected, but the called party cannot hear the sound of the calling party, and the calling party still displays that the call is in progress, but the calling party can hear the sound of the called party, thus the phenomenon of voice call single-pass is generated, and the experience of a user is poor.
Disclosure of Invention
The application provides a voice call method and a voice call device, which can solve single-pass faults between two parties of a voice call as far as possible, save the voice call as far as possible, enable the voice call to be normally carried out, and improve call experience of the two parties of the voice call.
In a first aspect, a voice call method is provided, including: the calling device sends an invitation request of voice call to the called device; the calling device receives a ringing notification from the called device, wherein the ringing notification is used for indicating that the called device rings; the calling device sends a confirmation message to the called device and adjusts the local call state to be on when any one of the following conditions is met: the called equipment does not have color ring service, and the calling equipment receives a first downlink real-time transmission protocol RTP packet after the ringing notification; or the called equipment has audio color ring service, the calling equipment receives the first downlink RTP packet after the ringing notification, and the sources of the first downlink RTP packet and the second downlink RTP packet are different, wherein the second downlink RTP packet is the downlink RTP packet received by the calling equipment before the ringing notification; or the called device has video color ring service, the calling device receives the first downlink RTP packet after the ringing notification, the sources of the first downlink RTP packet and the second downlink RTP packet are different, and the first downlink RTP packet does not comprise video data flow.
In the voice call method of the embodiment of the application, in the voice call establishment stage, the calling device sends the invitation request of the voice call to the called device, after receiving the ringing notification sent by the called device, the calling device can judge the abnormal call scene by combining all or part of information in the existence condition of the color ring service, the existence type of the color ring service and the source of the downlink RTP packet, and under the condition of recognizing the abnormal passing scene, the calling device sends the confirmation message to the called device and adjusts the local call state to be on, thus solving single-pass faults between two voice call parties as far as possible and saving the voice call as far as possible, so that the voice call can be normally carried out and the call experience of the two voice call parties is improved.
After the ringing notification, the called user connects to the call, and the called device sends voice data to the calling device. If the called equipment has the color ring service, after the ringing notification, before the called user connects the call, the color ring server can send color ring data to the calling equipment, and after the called user connects the call, the called equipment sends voice data to the calling equipment. Therefore, the above-mentioned abnormal call scenario is mainly judged by identifying whether the data packet (i.e. the above-mentioned first downlink RTP packet) received by the calling device after the ring notification is the color ring data or the voice data of the called device, i.e. identifying whether the source of the data packet is the color ring server or the called device. If the source of the data packet is the color ring server, it is possible that the called party does not connect the call, and the called equipment is still in the ringing state, the calling equipment can determine that the call is not the abnormal call scene at present, if the source of the data packet is the called equipment, it is indicated that the called party has connected the call, and the calling equipment can determine that the call is in the abnormal call scene at present.
With reference to the first aspect, in certain implementation manners of the first aspect, before the calling device sends an acknowledgement message to the called device, the method further includes: the calling device sends an update request to the called device, wherein the update request is used for requesting media stream change; the calling device receives an update response from the called device; and the calling device judges whether the called device has the color ring service or not based on the updating response.
The update request may be, for example, an update message, and the update response may be, for example, an update 200 message.
With reference to the first aspect, in some implementation manners of the first aspect, the determining, by the calling device, whether the called device has a color ring service based on the update response includes: the calling device judges whether the updating response comprises a first field and/or a second field, wherein the first field is used for indicating an audio color ring service, and the second field is used for indicating a video color ring service; and if the updating response does not include the first field and the second field, the calling device determines that the called device has no CRBT service.
With reference to the first aspect, in certain implementation manners of the first aspect, the method further includes: if the update response includes the first field and does not include the second field, the calling device determines that the called device has the audio color ring service.
With reference to the first aspect, in certain implementation manners of the first aspect, the method further includes: if the update response includes the first field and the second field, the calling device determines that the called device has the video color ring service.
By the method, whether the called equipment has the CRBT service or not and the corresponding CRBT service type when the called equipment has the CRBT service can be simply determined.
With reference to the first aspect, in some implementation manners of the first aspect, before the calling device determines, based on the update response, whether the called device has a color ring service, the method further includes: the calling device judges whether a successful response of the invitation request is received or not; the calling device judges whether the called device has the color ring back tone service based on the updating response, and the method comprises the following steps: and if the calling equipment does not receive the successful response of the invitation request, the calling equipment judges whether the called equipment has the color ring back tone service or not based on the updating response.
In the embodiment of the application, if the calling device receives the successful response of the invitation request, the calling device can reply the confirmation message to the called device, and the call is connected without subsequent judgment. When the calling device does not receive the successful response of the invitation request, the abnormal call scene may appear, and the calling device may determine the abnormal call scene, and determine whether the called device has the color ring back tone service.
The successful reply to the invite request may be, for example, an invite 200 message.
Thus, the abnormal call scene can be more accurately identified, unnecessary judgment is avoided, and the establishment efficiency of the voice call is improved.
With reference to the first aspect, in certain implementation manners of the first aspect, before the calling device receives a ring notification from the called device, the method further includes: and under the condition that the called equipment has the color ring service, the calling equipment receives the second downlink RTP packet from the color ring server.
With reference to the first aspect, in certain implementation manners of the first aspect, before the calling device sends an acknowledgement message to the called device, the method further includes: under the condition that the called equipment has the color ring service, the calling equipment compares whether the synchronous source SSRC of the first downlink RTP packet and the SSRC of the second downlink RTP packet are the same or not; and if the SSRC of the first downlink RTP packet is different from the SSRC of the second downlink RTP packet, the calling equipment determines that the sources of the first downlink RTP packet and the second downlink RTP packet are different.
In RTP, SSRC is used to represent the source from which the RTP packets are generated, i.e., the source of the RTP packets. The SSRC of the first downstream RTP packet is the same as the SSRC of the second downstream RTP packet, which indicates that the source of the first downstream RTP packet is the same as the source of the second downstream RTP packet; otherwise, if the SSRC of the first downstream RTP packet is different from the SSRC of the second downstream RTP packet, it means that the source of the first downstream RTP packet is different from the source of the second downstream RTP packet.
Therefore, the embodiment of the application determines that the current abnormal call scene is in through the fact that the calling equipment recognizes that the SSRC of the first downlink RTP packet and the SSRC of the second downlink RTP packet are different, and the calling equipment actively replies the confirmation message to the called equipment, so that the call is attempted to be saved.
In a second aspect, a voice call apparatus is provided for performing the method in any one of the possible implementation manners of the first aspect. In particular, the apparatus comprises means for performing the method in any one of the possible implementations of the first aspect described above.
In a third aspect, the present application provides a further voice telephony device comprising a processor coupled to a memory operable to execute instructions in the memory to implement a method as in any of the possible implementations of the first aspect. Optionally, the communication device further comprises a memory. Optionally, the communication authentication device further comprises a communication interface, and the processor is coupled to the communication interface.
In one implementation, the voice call apparatus is a terminal device. When the voice call apparatus is a terminal device, the communication interface may be a transceiver, or an input/output interface.
In another implementation, the voice call apparatus is a chip configured in the terminal device. When the voice call apparatus is a chip configured in a terminal device, the communication interface may be an input/output interface.
In a fourth aspect, there is provided a processor comprising: input circuit, output circuit and processing circuit. The processing circuit is configured to receive a signal via the input circuit and transmit a signal via the output circuit, such that the processor performs the method of any one of the possible implementations of the first aspect.
In a specific implementation process, the processor may be a chip, the input circuit may be an input pin, the output circuit may be an output pin, and the processing circuit may be a transistor, a gate circuit, a trigger, various logic circuits, and the like. The input signal received by the input circuit may be received and input by, for example and without limitation, a receiver, the output signal may be output by, for example and without limitation, a transmitter and transmitted by a transmitter, and the input circuit and the output circuit may be the same circuit, which functions as the input circuit and the output circuit, respectively, at different times. The embodiment of the application does not limit the specific implementation modes of the processor and various circuits.
In a fifth aspect, a processing device is provided that includes a processor and a memory. The processor is configured to read instructions stored in the memory and to receive signals via the receiver and to transmit signals via the transmitter to perform the method of any one of the possible implementations of the first aspect.
Optionally, the processor is one or more, and the memory is one or more.
Alternatively, the memory may be integrated with the processor or the memory may be separate from the processor.
In a specific implementation process, the memory may be a non-transient (non-transitory) memory, for example, a Read Only Memory (ROM), which may be integrated on the same chip as the processor, or may be separately disposed on different chips.
It should be appreciated that the related data interaction process, for example, transmitting the indication information, may be a process of outputting the indication information from the processor, and the receiving the capability information may be a process of receiving the input capability information by the processor. Specifically, the data output by the processing may be output to the transmitter, and the input data received by the processor may be from the receiver. Wherein the transmitter and receiver may be collectively referred to as a transceiver.
The processing means in the fifth aspect may be a chip, and the processor may be implemented by hardware or by software, and when implemented by hardware, the processor may be a logic circuit, an integrated circuit, or the like; when implemented in software, the processor may be a general-purpose processor, implemented by reading software code stored in a memory, which may be integrated in the processor, or may reside outside the processor, and exist separately.
In a sixth aspect, there is provided a computer program product comprising: a computer program (which may also be referred to as code, or instructions) which, when executed, causes a computer to perform the method of any one of the possible implementations of the first aspect.
In a seventh aspect, a computer readable storage medium is provided, which stores a computer program (which may also be referred to as code, or instructions) which, when run on a computer, causes the computer to perform the method of any one of the possible implementations of the first aspect.
Drawings
Fig. 1 is a schematic structural diagram of a terminal device according to an embodiment of the present application;
Fig. 2 is a software configuration block diagram of a terminal device according to an embodiment of the present application;
FIG. 3 is a schematic flow chart of a voice call setup procedure;
fig. 4 is a schematic flow chart of a voice call method provided by an embodiment of the present application;
FIG. 5 is a schematic flow chart of another voice call method provided by an embodiment of the present application;
FIG. 6 is a schematic flow chart of yet another voice call method provided by an embodiment of the present application;
FIG. 7 is a schematic block diagram of a voice call apparatus according to an embodiment of the present application;
fig. 8 is a schematic block diagram of another voice call apparatus according to an embodiment of the present application.
Detailed Description
The technical scheme of the application will be described below with reference to the accompanying drawings.
In order to clearly describe the technical solution of the embodiments of the present application, in the embodiments of the present application, the words "first", "second", etc. are used to distinguish the same item or similar items having substantially the same function and effect. It will be appreciated by those of skill in the art that the words "first," "second," and the like do not limit the amount and order of execution, and that the words "first," "second," and the like do not necessarily differ.
It should be noted that, in the embodiments of the present application, words such as "exemplary" or "such as" are used to mean serving as an example, instance, or illustration. Any embodiment or design described herein as "exemplary" or "for example" should not be construed as preferred or advantageous over other embodiments or designs. Rather, the use of words such as "exemplary" or "such as" is intended to present related concepts in a concrete fashion.
In the embodiments of the present application, "at least one" means one or more, and "a plurality" means two or more. "and/or", describes an association relationship of an association object, and indicates that there may be three relationships, for example, a and/or B, and may indicate: a alone, a and B together, and B alone, wherein a, B may be singular or plural. The character "/" generally indicates that the context-dependent object is an "or" relationship. "at least one of" or the like means any combination of these items, including any combination of single item(s) or plural items(s). For example, at least one (one) of a, b, or c may represent: a, b, c, a-b, a-c, b-c, or a-b-c, wherein a, b, c may be single or plural.
The voice call method of the embodiment of the application can be applied to terminal equipment, and the terminal equipment can be a mobile phone or other terminal equipment with a call function.
In order to better understand the embodiments of the present application, a description is given below of terminal devices related to the embodiments of the present application.
Fig. 1 shows a schematic structure of a terminal device 100.
The terminal device 100 may include a processor 110, an external memory interface 120, an internal memory 121, a universal serial bus (universal serial bus, USB) interface 130, a charge management module 140, a power management module 141, a battery 142, an antenna 1, an antenna 2, a mobile communication module 150, a wireless communication module 160, an audio module 170, a speaker 170A, a receiver 170B, a microphone 170C, an earphone interface 170D, a sensor module 180, keys 190, a motor 191, an indicator 192, a camera 193, a display 194, and a subscriber identity module (subscriber identification module, SIM) card interface 195, etc. The sensor module 180 may include a pressure sensor 180A, a gyro sensor 180B, an air pressure sensor 180C, a magnetic sensor 180D, an acceleration sensor 180E, a distance sensor 180F, a proximity sensor 180G, a fingerprint sensor 180H, a temperature sensor 180J, a touch sensor 180K, an ambient light sensor 180L, a bone conduction sensor 180M, and the like.
It is to be understood that the structure illustrated in the embodiment of the present application does not constitute a specific limitation on the terminal device 100. In other embodiments of the application, terminal device 100 may include more or less components than illustrated, or certain components may be combined, or certain components may be split, or different arrangements of components. The illustrated components may be implemented in hardware, software, or a combination of software and hardware.
The processor 110 may include one or more processing units, such as: the processor 110 may include an application processor (application processor, AP), a modem processor, a graphics processor (graphics processing unit, GPU), an image signal processor (image signal processor, ISP), a controller, a video codec, a digital signal processor (digital signal processor, DSP), a baseband processor, and/or a neural network processor (neural-network processing unit, NPU), etc. Wherein the different processing units may be separate devices or may be integrated in one or more processors.
The controller can generate operation control signals according to the instruction operation codes and the time sequence signals to finish the control of instruction fetching and instruction execution.
A memory may also be provided in the processor 110 for storing instructions and data. In some embodiments, the memory in the processor 110 is a cache memory. The memory may hold instructions or data that the processor 110 has just used or recycled. If the processor 110 needs to reuse the instruction or data, it can be called directly from the memory. Repeated accesses are avoided and the latency of the processor 110 is reduced, thereby improving the efficiency of the system.
In some embodiments, the processor 110 may include one or more interfaces. The interfaces may include an integrated circuit (inter-integrated circuit, I2C) interface, an integrated circuit built-in audio (inter-integrated circuit sound, I2S) interface, a pulse code modulation (pulse code modulation, PCM) interface, a universal asynchronous receiver transmitter (universal asynchronous receiver/transmitter, UART) interface, a mobile industry processor interface (mobile industry processor interface, MIPI), a general-purpose input/output (GPIO) interface, a subscriber identity module (subscriber identity module, SIM) interface, and/or a universal serial bus (universal serial bus, USB) interface, among others.
The I2C interface is a bi-directional synchronous serial bus comprising a serial data line (SDA) and a serial clock line (derail clock line, SCL). In some embodiments, the processor 110 may contain multiple sets of I2C buses. The processor 110 may be coupled to the touch sensor 180K, charger, flash, camera 193, etc., respectively, through different I2C bus interfaces. For example: the processor 110 may be coupled to the touch sensor 180K through an I2C interface, so that the processor 110 and the touch sensor 180K communicate through an I2C bus interface to implement a touch function of the terminal device 100.
The I2S interface may be used for audio communication. In some embodiments, the processor 110 may contain multiple sets of I2S buses. The processor 110 may be coupled to the audio module 170 via an I2S bus to enable communication between the processor 110 and the audio module 170. In some embodiments, the audio module 170 may transmit an audio signal to the wireless communication module 160 through the I2S interface, to implement a function of answering a call through the bluetooth headset.
PCM interfaces may also be used for audio communication to sample, quantize and encode analog signals. In some embodiments, the audio module 170 and the wireless communication module 160 may be coupled through a PCM bus interface. In some embodiments, the audio module 170 may also transmit audio signals to the wireless communication module 160 through the PCM interface to implement a function of answering a call through the bluetooth headset. Both the I2S interface and the PCM interface may be used for audio communication.
The UART interface is a universal serial data bus for asynchronous communications. The bus may be a bi-directional communication bus. It converts the data to be transmitted between serial communication and parallel communication. In some embodiments, a UART interface is typically used to connect the processor 110 with the wireless communication module 160. For example: the processor 110 communicates with a bluetooth module in the wireless communication module 160 through a UART interface to implement a bluetooth function. In some embodiments, the audio module 170 may transmit an audio signal to the wireless communication module 160 through a UART interface, to implement a function of playing music through a bluetooth headset.
The MIPI interface may be used to connect the processor 110 to peripheral devices such as a display 194, a camera 193, and the like. The MIPI interfaces include camera serial interfaces (camera serial interface, CSI), display serial interfaces (display serial interface, DSI), and the like. In some embodiments, processor 110 and camera 193 communicate through a CSI interface to implement the photographing function of terminal device 100. The processor 110 and the display 194 communicate via a DSI interface to implement the display function of the terminal device 100.
The GPIO interface may be configured by software. The GPIO interface may be configured as a control signal or as a data signal. In some embodiments, a GPIO interface may be used to connect the processor 110 with the camera 193, the display 194, the wireless communication module 160, the audio module 170, the sensor module 180, and the like. The GPIO interface may also be configured as an I2C interface, an I2S interface, a UART interface, an MIPI interface, etc.
The USB interface 130 is an interface conforming to the USB standard specification, and may specifically be a Mini USB interface, a Micro USB interface, a USB Type C interface, or the like. The USB interface 130 may be used to connect a charger to charge the terminal device 100, or may be used to transfer data between the terminal device 100 and a peripheral device. And can also be used for connecting with a headset, and playing audio through the headset. The interface may also be used to connect other terminal devices, such as AR devices, etc.
It should be understood that the interfacing relationship between the modules illustrated in the embodiment of the present application is only illustrative, and does not constitute a structural limitation of the terminal device 100. In other embodiments of the present application, the terminal device 100 may also use different interfacing manners, or a combination of multiple interfacing manners in the foregoing embodiments.
The charge management module 140 is configured to receive a charge input from a charger. The charger can be a wireless charger or a wired charger. In some wired charging embodiments, the charge management module 140 may receive a charging input of a wired charger through the USB interface 130. In some wireless charging embodiments, the charge management module 140 may receive wireless charging input through a wireless charging coil of the terminal device 100. The charging management module 140 may also supply power to the terminal device through the power management module 141 while charging the battery 142.
The power management module 141 is used for connecting the battery 142, and the charge management module 140 and the processor 110. The power management module 141 receives input from the battery 142 and/or the charge management module 140 to power the processor 110, the internal memory 121, the display 194, the camera 193, the wireless communication module 160, and the like. The power management module 141 may also be configured to monitor battery capacity, battery cycle number, battery health (leakage, impedance) and other parameters. In other embodiments, the power management module 141 may also be provided in the processor 110. In other embodiments, the power management module 141 and the charge management module 140 may be disposed in the same device.
The wireless communication function of the terminal device 100 can be implemented by the antenna 1, the antenna 2, the mobile communication module 150, the wireless communication module 160, a modem processor, a baseband processor, and the like.
The antennas 1 and 2 are used for transmitting and receiving electromagnetic wave signals. Each antenna in the terminal device 100 may be used to cover a single or multiple communication bands. Different antennas may also be multiplexed to improve the utilization of the antennas. For example: the antenna 1 may be multiplexed into a diversity antenna of a wireless local area network. In other embodiments, the antenna may be used in conjunction with a tuning switch.
The mobile communication module 150 may provide a solution including 2G/3G/4G/5G wireless communication applied to the terminal device 100. The mobile communication module 150 may include at least one filter, switch, power amplifier, low noise amplifier (low noise amplifier, LNA), etc. The mobile communication module 150 may receive electromagnetic waves from the antenna 1, perform processes such as filtering, amplifying, and the like on the received electromagnetic waves, and transmit the processed electromagnetic waves to the modem processor for demodulation. The mobile communication module 150 can amplify the signal modulated by the modem processor, and convert the signal into electromagnetic waves through the antenna 1 to radiate. In some embodiments, at least some of the functional modules of the mobile communication module 150 may be disposed in the processor 110. In some embodiments, at least some of the functional modules of the mobile communication module 150 may be provided in the same device as at least some of the modules of the processor 110.
The modem processor may include a modulator and a demodulator. The modulator is used for modulating the low-frequency baseband signal to be transmitted into a medium-high frequency signal. The demodulator is used for demodulating the received electromagnetic wave signal into a low-frequency baseband signal. The demodulator then transmits the demodulated low frequency baseband signal to the baseband processor for processing. The low frequency baseband signal is processed by the baseband processor and then transferred to the application processor. The application processor outputs sound signals through an audio device (not limited to the speaker 170A, the receiver 170B, etc.), or displays images or video through the display screen 194. In some embodiments, the modem processor may be a stand-alone device. In other embodiments, the modem processor may be provided in the same device as the mobile communication module 150 or other functional module, independent of the processor 110.
The wireless communication module 160 may provide solutions for wireless communication including wireless local area network (wireless local area networks, WLAN) (e.g., wireless fidelity (wireless fidelity, wi-Fi) network), bluetooth (BT), global navigation satellite system (global navigation satellite system, GNSS), frequency modulation (frequency modulation, FM), near field wireless communication technology (near field communication, NFC), infrared technology (IR), etc., applied to the terminal device 100. The wireless communication module 160 may be one or more devices that integrate at least one communication processing module. The wireless communication module 160 receives electromagnetic waves via the antenna 2, modulates the electromagnetic wave signals, filters the electromagnetic wave signals, and transmits the processed signals to the processor 110. The wireless communication module 160 may also receive a signal to be transmitted from the processor 110, frequency modulate it, amplify it, and convert it to electromagnetic waves for radiation via the antenna 2.
In some embodiments, antenna 1 and mobile communication module 150 of terminal device 100 are coupled, and antenna 2 and wireless communication module 160 are coupled, such that terminal device 100 may communicate with a network and other devices via wireless communication techniques. The wireless communication techniques may include the Global System for Mobile communications (global system for mobile communications, GSM), general packet radio service (general packet radio service, GPRS), code division multiple access (code division multiple access, CDMA), wideband code division multiple access (wideband code division multiple access, WCDMA), time division code division multiple access (time-division code division multiple access, TD-SCDMA), long term evolution (long term evolution, LTE), BT, GNSS, WLAN, NFC, FM, and/or IR techniques, among others. The GNSS may include a global satellite positioning system (global positioning system, GPS), a global navigation satellite system (global navigation satellite system, GLONASS), a beidou satellite navigation system (beidou navigation satellite system, BDS), a quasi zenith satellite system (quasi-zenith satellite system, QZSS) and/or a satellite based augmentation system (satellite based augmentation systems, SBAS).
The terminal device 100 implements display functions through a GPU, a display screen 194, an application processor, and the like. The GPU is a microprocessor for image processing, and is connected to the display 194 and the application processor. The GPU is used to perform mathematical and geometric calculations for graphics rendering. Processor 110 may include one or more GPUs that execute program instructions to generate or change display information.
The display screen 194 is used to display images, videos, and the like. The display 194 includes a display panel. The display panel may employ a liquid crystal display (liquid crystal display, LCD), an organic light-emitting diode (OLED), an active-matrix organic light-emitting diode (AMOLED) or an active-matrix organic light-emitting diode (matrix organic light emitting diode), a flexible light-emitting diode (flex), a mini, a Micro led, a Micro-OLED, a quantum dot light-emitting diode (quantum dot light emitting diodes, QLED), or the like. In some embodiments, the terminal device 100 may include 1 or N display screens 194, N being a positive integer greater than 1.
The terminal device 100 may implement a photographing function through an ISP, a camera 193, a video codec, a GPU, a display screen 194, an application processor, and the like.
The ISP is used to process data fed back by the camera 193. For example, when photographing, the shutter is opened, light is transmitted to the camera photosensitive element through the lens, the optical signal is converted into an electric signal, and the camera photosensitive element transmits the electric signal to the ISP for processing and is converted into an image visible to naked eyes. ISP can also optimize the noise, brightness and skin color of the image. The ISP can also optimize parameters such as exposure, color temperature and the like of a shooting scene. In some embodiments, the ISP may be provided in the camera 193.
The camera 193 is used to capture still images or video. The object generates an optical image through the lens and projects the optical image onto the photosensitive element. The photosensitive element may be a charge coupled device (charge coupled device, CCD) or a Complementary Metal Oxide Semiconductor (CMOS) phototransistor. The photosensitive element converts the optical signal into an electrical signal, which is then transferred to the ISP to be converted into a digital image signal. The ISP outputs the digital image signal to the DSP for processing. The DSP converts the digital image signal into an image signal in a standard RGB, YUV, or the like format. In some embodiments, the terminal device 100 may include 1 or N cameras 193, N being a positive integer greater than 1.
The digital signal processor is used for processing digital signals, and can process other digital signals besides digital image signals. For example, when the terminal device 100 selects a frequency bin, the digital signal processor is used to fourier transform the frequency bin energy, or the like.
Video codecs are used to compress or decompress digital video. The terminal device 100 may support one or more video codecs. In this way, the terminal device 100 can play or record video in various encoding formats, for example: dynamic picture experts group (moving picture experts group, MPEG) 1, MPEG2, MPEG3, MPEG4, etc.
The NPU is a neural-network (NN) computing processor, and can rapidly process input information by referencing a biological neural network structure, for example, referencing a transmission mode between human brain neurons, and can also continuously perform self-learning. Applications such as intelligent awareness of the terminal device 100 may be implemented by the NPU, for example: image recognition, face recognition, speech recognition, text understanding, etc.
The external memory interface 120 may be used to connect an external memory card, such as a Micro SD card, to realize expansion of the memory capability of the terminal device 100. The external memory card communicates with the processor 110 through an external memory interface 120 to implement data storage functions. For example, files such as music, video, etc. are stored in an external memory card.
The internal memory 121 may be used to store computer executable program code including instructions. The internal memory 121 may include a storage program area and a storage data area. The storage program area may store an application program (such as a sound playing function, an image playing function, etc.) required for at least one function of the operating system, etc. The storage data area may store data (such as audio data, phonebook, etc.) created during use of the terminal device 100, and the like. In addition, the internal memory 121 may include a high-speed random access memory, and may further include a nonvolatile memory such as at least one magnetic disk storage device, a flash memory device, a universal flash memory (universal flash storage, UFS), and the like. The processor 110 performs various functional applications of the terminal device 100 and data processing by executing instructions stored in the internal memory 121 and/or instructions stored in a memory provided in the processor.
The terminal device 100 may implement audio functions through an audio module 170, a speaker 170A, a receiver 170B, a microphone 170C, an earphone interface 170D, an application processor, and the like. Such as music playing, recording, etc.
The audio module 170 is used to convert digital audio information into an analog audio signal output and also to convert an analog audio input into a digital audio signal. The audio module 170 may also be used to encode and decode audio signals. In some embodiments, the audio module 170 may be disposed in the processor 110, or a portion of the functional modules of the audio module 170 may be disposed in the processor 110.
The speaker 170A, also referred to as a "horn," is used to convert audio electrical signals into sound signals. The terminal device 100 can listen to music or to handsfree talk through the speaker 170A.
A receiver 170B, also referred to as a "earpiece", is used to convert the audio electrical signal into a sound signal. When the terminal device 100 receives a call or voice message, it is possible to receive voice by approaching the receiver 170B to the human ear.
Microphone 170C, also referred to as a "microphone" or "microphone", is used to convert sound signals into electrical signals. When making a call or transmitting voice information, the user can sound near the microphone 170C through the mouth, inputting a sound signal to the microphone 170C. The terminal device 100 may be provided with at least one microphone 170C. In other embodiments, the terminal device 100 may be provided with two microphones 170C, and may implement a noise reduction function in addition to collecting sound signals. In other embodiments, the terminal device 100 may be further provided with three, four or more microphones 170C to collect sound signals, reduce noise, identify the source of sound, implement directional recording functions, etc.
The earphone interface 170D is used to connect a wired earphone. The earphone interface 170D may be a USB interface 130 or a 3.5mm open mobile terminal platform (open mobile terminal platform, OMTP) standard interface, a american cellular telecommunications industry association (cellular telecommunications industry association of the USA, CTIA) standard interface.
The pressure sensor 180A is used to sense a pressure signal, and may convert the pressure signal into an electrical signal. In some embodiments, the pressure sensor 180A may be disposed on the display screen 194. The pressure sensor 180A is of various types, such as a resistive pressure sensor, an inductive pressure sensor, a capacitive pressure sensor, and the like. The capacitive pressure sensor may be a capacitive pressure sensor comprising at least two parallel plates with conductive material. The capacitance between the electrodes changes when a force is applied to the pressure sensor 180A. The terminal device 100 determines the intensity of the pressure according to the change of the capacitance. When a touch operation is applied to the display 194, the terminal device 100 detects the intensity of the touch operation according to the pressure sensor 180A. The terminal device 100 may also calculate the position of the touch from the detection signal of the pressure sensor 180A. In some embodiments, touch operations that act on the same touch location, but at different touch operation strengths, may correspond to different operation instructions. For example: and executing an instruction for checking the short message when the touch operation with the touch operation intensity smaller than the first pressure threshold acts on the short message application icon. And executing an instruction for newly creating the short message when the touch operation with the touch operation intensity being greater than or equal to the first pressure threshold acts on the short message application icon.
The gyro sensor 180B may be used to determine a motion gesture of the terminal device 100. In some embodiments, the angular velocity of the terminal device 100 about three axes (i.e., x, y, and z axes) may be determined by the gyro sensor 180B. The gyro sensor 180B may be used for photographing anti-shake. Illustratively, when the shutter is pressed, the gyro sensor 180B detects the angle of the shake of the terminal device 100, calculates the distance to be compensated by the lens module according to the angle, and allows the lens to counteract the shake of the terminal device 100 by the reverse motion, thereby realizing anti-shake. The gyro sensor 180B may also be used for navigating, somatosensory game scenes.
The air pressure sensor 180C is used to measure air pressure. In some embodiments, the terminal device 100 calculates altitude from barometric pressure values measured by the barometric pressure sensor 180C, aiding in positioning and navigation.
The magnetic sensor 180D includes a hall sensor. The terminal device 100 can detect the opening and closing of the flip cover using the magnetic sensor 180D. In some embodiments, when the terminal device 100 is a folder, the terminal device 100 may detect opening and closing of the folder according to the magnetic sensor 180D. And then according to the detected opening and closing state of the leather sheath or the opening and closing state of the flip, the characteristics of automatic unlocking of the flip and the like are set.
The acceleration sensor 180E can detect the magnitude of acceleration of the terminal device 100 in various directions (typically three axes). The magnitude and direction of gravity may be detected when the terminal device 100 is stationary. The method can also be used for identifying the gesture of the terminal equipment, and is applied to the applications such as horizontal and vertical screen switching, pedometers and the like.
A distance sensor 180F for measuring a distance. The terminal device 100 may measure the distance by infrared or laser. In some embodiments, the terminal device 100 may range using the distance sensor 180F to achieve fast focusing.
The proximity light sensor 180G may include, for example, a Light Emitting Diode (LED) and a light detector, such as a photodiode. The light emitting diode may be an infrared light emitting diode. The terminal device 100 emits infrared light outward through the light emitting diode. The terminal device 100 detects infrared reflected light from a nearby object using a photodiode. When sufficient reflected light is detected, it can be determined that there is an object in the vicinity of the terminal device 100. When insufficient reflected light is detected, the terminal device 100 may determine that there is no object in the vicinity of the terminal device 100. The terminal device 100 can detect that the user holds the terminal device 100 close to the ear to talk by using the proximity light sensor 180G, so as to automatically extinguish the screen for the purpose of saving power. The proximity light sensor 180G may also be used in holster mode, pocket mode to automatically unlock and lock the screen.
The ambient light sensor 180L is used to sense ambient light level. The terminal device 100 may adaptively adjust the brightness of the display 194 based on the perceived ambient light level. The ambient light sensor 180L may also be used to automatically adjust white balance when taking a photograph. The ambient light sensor 180L may also cooperate with the proximity light sensor 180G to detect whether the terminal device 100 is in a pocket to prevent false touches.
The fingerprint sensor 180H is used to collect a fingerprint. The terminal device 100 can utilize the collected fingerprint characteristics to realize fingerprint unlocking, access an application lock, fingerprint photographing, fingerprint incoming call answering and the like.
The temperature sensor 180J is for detecting temperature. In some embodiments, the terminal device 100 performs a temperature processing strategy using the temperature detected by the temperature sensor 180J. For example, when the temperature reported by the temperature sensor 180J exceeds a threshold, the terminal device 100 performs a reduction in the performance of a processor located near the temperature sensor 180J in order to reduce power consumption to implement thermal protection. In other embodiments, when the temperature is below another threshold, the terminal device 100 heats the battery 142 to avoid the low temperature causing the terminal device 100 to shut down abnormally. In other embodiments, when the temperature is below a further threshold, the terminal device 100 performs boosting of the output voltage of the battery 142 to avoid abnormal shutdown caused by low temperatures.
The touch sensor 180K, also referred to as a "touch device". The touch sensor 180K may be disposed on the display screen 194, and the touch sensor 180K and the display screen 194 form a touch screen, which is also called a "touch screen". The touch sensor 180K is for detecting a touch operation acting thereon or thereabout. The touch sensor may communicate the detected touch operation to the application processor to determine the touch event type. Visual output related to touch operations may be provided through the display 194. In other embodiments, the touch sensor 180K may also be disposed on the surface of the terminal device 100 at a different location than the display 194.
The bone conduction sensor 180M may acquire a vibration signal. In some embodiments, bone conduction sensor 180M may acquire a vibration signal of a human vocal tract vibrating bone pieces. The bone conduction sensor 180M may also contact the pulse of the human body to receive the blood pressure pulsation signal. In some embodiments, bone conduction sensor 180M may also be provided in a headset, in combination with an osteoinductive headset. The audio module 170 may analyze the voice signal based on the vibration signal of the sound portion vibration bone block obtained by the bone conduction sensor 180M, so as to implement a voice function. The application processor may analyze the heart rate information based on the blood pressure beat signal acquired by the bone conduction sensor 180M, so as to implement a heart rate detection function.
The keys 190 include a power-on key, a volume key, etc. The keys 190 may be mechanical keys. Or may be a touch key. The terminal device 100 may receive key inputs, generating key signal inputs related to user settings and function controls of the terminal device 100.
The motor 191 may generate a vibration cue. The motor 191 may be used for incoming call vibration alerting as well as for touch vibration feedback. For example, touch operations acting on different applications (e.g., photographing, audio playing, etc.) may correspond to different vibration feedback effects. The motor 191 may also correspond to different vibration feedback effects by touching different areas of the display screen 194. Different application scenarios (such as time reminding, receiving information, alarm clock, game, etc.) can also correspond to different vibration feedback effects. The touch vibration feedback effect may also support customization.
The indicator 192 may be an indicator light, may be used to indicate a state of charge, a change in charge, a message indicating a missed call, a notification, etc.
The SIM card interface 195 is used to connect a SIM card. The SIM card may be contacted and separated from the terminal apparatus 100 by being inserted into the SIM card interface 195 or by being withdrawn from the SIM card interface 195. The terminal device 100 may support 1 or N SIM card interfaces, N being a positive integer greater than 1. The SIM card interface 195 may support Nano SIM cards, micro SIM cards, and the like. The same SIM card interface 195 may be used to insert multiple cards simultaneously. The types of the plurality of cards may be the same or different. The SIM card interface 195 may also be compatible with different types of SIM cards. The SIM card interface 195 may also be compatible with external memory cards. The terminal device 100 interacts with the network through the SIM card to realize functions such as call and data communication. In some embodiments, the terminal device 100 employs esims, namely: an embedded SIM card. The eSIM card can be embedded in the terminal device 100 and cannot be separated from the terminal device 100.
The software system of the terminal device 100 may employ a layered architecture, an event driven architecture, a micro-core architecture, a micro-service architecture, or a cloud architecture. In the embodiment of the application, taking an Android system with a layered architecture as an example, a software structure of the terminal device 100 is illustrated.
Fig. 2 is a software configuration block diagram of the terminal device 100 of the embodiment of the present application.
The layered architecture divides the software into several layers, each with distinct roles and branches. The layers communicate with each other through a software interface. In some embodiments, the Android system is divided into four layers, from top to bottom, an application layer, an application framework layer, an Zhuoyun row (Android run) and system libraries, and a kernel layer, respectively.
The application layer may include a series of application packages.
As shown in fig. 2, the application package may include applications for cameras, gallery, calendar, phone calls, maps, navigation, WLAN, bluetooth, music, video, short messages, etc.
The application framework layer provides an application programming interface (application programming interface, API) and programming framework for application programs of the application layer. The application framework layer includes a number of predefined functions.
As shown in FIG. 2, the application framework layer may include a window manager, a content provider, a view system, a telephony manager, a resource manager, a notification manager, and the like.
The window manager is used for managing window programs. The window manager can acquire the size of the display screen, judge whether a status bar exists, lock the screen, intercept the screen and the like.
The content provider is used to store and retrieve data and make such data accessible to applications. The data may include video, images, audio, calls made and received, browsing history and bookmarks, phonebooks, etc.
The view system includes visual controls, such as controls to display text, controls to display pictures, and the like. The view system may be used to build applications. The display interface may be composed of one or more views. For example, a display interface including a text message notification icon may include a view displaying text and a view displaying a picture.
The telephony manager is used to provide the communication functions of the terminal device 100. Such as the management of call status (including on, hung-up, etc.).
The resource manager provides various resources for the application program, such as localization strings, icons, pictures, layout files, video files, and the like.
The notification manager allows the application to display notification information in a status bar, can be used to communicate notification type messages, can automatically disappear after a short dwell, and does not require user interaction. Such as notification manager is used to inform that the download is complete, message alerts, etc. The notification manager may also be a notification in the form of a chart or scroll bar text that appears on the system top status bar, such as a notification of a background running application, or a notification that appears on the screen in the form of a dialog window. For example, a text message is prompted in a status bar, a prompt tone is emitted, the terminal equipment vibrates, and an indicator light blinks.
Android run time includes a core library and virtual machines. Android run time is responsible for scheduling and management of the Android system.
The core library consists of two parts: one part is a function which needs to be called by java language, and the other part is a core library of android.
The application layer and the application framework layer run in a virtual machine. The virtual machine executes java files of the application program layer and the application program framework layer as binary files. The virtual machine is used for executing the functions of object life cycle management, stack management, thread management, security and exception management, garbage collection and the like.
The system library may include a plurality of functional modules. For example: surface manager (surface manager), media library (media library), three-dimensional graphics processing library (e.g., openGL ES), 2D graphics engine (e.g., SGL), etc.
The surface manager is used to manage the display subsystem and provides a fusion of 2D and 3D layers for multiple applications.
Media libraries support a variety of commonly used audio, video format playback and recording, still image files, and the like. The media library may support a variety of audio and video encoding formats, such as MPEG4, h.264, MP3, AAC, AMR, JPG, PNG, etc.
The three-dimensional graphic processing library is used for realizing three-dimensional graphic drawing, image rendering, synthesis, layer processing and the like.
The 2D graphics engine is a drawing engine for 2D drawing.
The kernel layer is a layer between hardware and software. The inner core layer at least comprises a display driver, a camera driver, an audio driver and a sensor driver.
The workflow of the terminal device 100 software and hardware is illustrated below in connection with capturing a photo scene.
When touch sensor 180K receives a touch operation, a corresponding hardware interrupt is issued to the kernel layer. The kernel layer processes the touch operation into the original input event (including information such as touch coordinates, time stamp of touch operation, etc.). The original input event is stored at the kernel layer. The application framework layer acquires an original input event from the kernel layer, and identifies a control corresponding to the input event. Taking the touch operation as a touch click operation, taking a control corresponding to the click operation as an example of a control of a camera application icon, the camera application calls an interface of an application framework layer, starts the camera application, further starts a camera driver by calling a kernel layer, and captures a still image or video by the camera 193.
The voice call is a basic function of the terminal device, and the two devices of the voice call can be called a calling device and a called device, and the calling device and the called device can be any devices with voice call functions, such as mobile phones. Before the voice call is carried out, the calling device and the called device can establish communication connection through a network to complete a signaling interaction flow before the call so as to ensure the follow-up normal call of the calling party and the called party.
The current voice call setup procedure is described in detail below in conjunction with fig. 3. As shown in fig. 3, the voice call setup procedure between the calling device and the called device may include the steps of:
s301, the calling device sends a voice call request to the IP multimedia system (IP multimedia subsystem, IMS), where the voice call request is used to request a voice call with the called device. Correspondingly, the IMS receives the voice call request.
Illustratively, the calling device may create a session with the called device via a session initiation protocol (session initiation protocol, SIP). SIP is an application-layer control protocol formulated by the intel engineering task force (internet engineering task force, LETF) for setting up, modifying and terminating multimedia sessions. SIP is a communication mode based on a request reply like hypertext transfer protocol (hypertext transfer protocol, HTTP), each communication message containing a request for a certain function and corresponding to a reply.
The voice call request may be, for example, an invite (invite) message.
S302, the calling device sends a service request (service request) to the calling base station, wherein the service request is used for requesting the calling base station to provide service for the calling device. Correspondingly, the calling base station receives the service request.
The calling base station is a base station accessed by the calling device, namely the calling device accesses the wireless access network through the calling base station.
S303, the calling device establishes a radio resource control (radio resource control, RRC) connection with the calling base station to establish a radio bearer.
S304, the calling base station sends a service request to the calling EPC, wherein the service request is used for requesting the calling EPC to provide services for the calling base station. Correspondingly, the calling EPC receives the service request.
S305, the calling base station sends a voice call request to the IMS, and correspondingly, the IMS receives the voice call request.
The voice request here may be an invite message, for example.
S306, a security procedure is performed between the calling device and the calling evolved packet core (evolved packet core, EPC), and the security procedure is mainly used for establishing a secure channel between the calling device and the calling EPC.
S307, the RRC reconfiguration process is executed between the calling equipment and the calling base station, and RRC connection reconfiguration is carried out.
S308, the IMS sends a first voice call response to the calling device, wherein the first voice call response is used for responding to the voice call request sent by the calling device, and the voice call request is successfully received by the IMS. Correspondingly, the calling device receives the first voice call response.
Illustratively, in the case where the voice call request is an invite message, the first voice pass response may be, for example, an invite 100 message.
S309, the IMS sends a voice call request to the called EPC, where the voice call request is sent by the calling device and is used to request to make a voice call with the called device. Correspondingly, the called EPC receives the voice call request.
The voice call request may be an invite message, for example.
S310, the called EPC sends a paging (paging) message to the called equipment through the called base station. Correspondingly, the called device receives the paging message.
S311, the called equipment sends a service request to the called base station, wherein the service request is used for requesting the called base station to provide service for the called equipment. Correspondingly, the called base station receives the service request.
The service request may be, for example, a service request message.
S312, the called equipment establishes RRC connection with the called base station to establish a radio bearer.
S313, executing a safety process between the called equipment and the called EPC, wherein the safety process is mainly used for establishing a safety channel between the called equipment and the called EPC.
S314, the RRC reconfiguration process is executed between the called equipment and the called base station, and RRC connection reconfiguration is carried out.
S315, the called EPC sends a voice call request to the called equipment. Correspondingly, the called device receives the voice call request.
S316, the called equipment sends a first voice call response to the IMS based on the voice call request, wherein the first voice call response is used for responding to the voice call request sent by the IMS, and the called equipment successfully receives the voice call request. Correspondingly, the IMS receives the first voice call response.
The first voice call response may be, for example, an invite 100 message.
S317, the called device sends a second voice call response to the IMS, wherein the second voice call response is used for responding to the voice call request, and the second voice call response indicates that the voice call request is in process. Correspondingly, the IMS receives the second voice call response.
Illustratively, the called device, after sending the invite 100 message, may continue to send an invite 183 message, indicating that the voice call request is being processed.
S318, the called device and the called EPC establish an evolved packet system (evolved packet system, EPS) bearer, where the EPS bearer is used for subsequent voice data transmission between the called device and the called EPC.
S319, the calling device and the calling EPC establish an EPS bearer, where the EPS bearer is used for subsequent voice data transmission between the calling device and the calling EPC.
S320, the IMS sends the second voice call response to the calling device. Correspondingly, the calling device receives the second voice call response.
S321, the calling device sends a temporary response acknowledgement (provisional response acknowledgement, PRACK) message to the IMS. Correspondingly, the IMS receives the PRACK message sent from the calling device.
This is a pre-acknowledgement procedure, mainly to prevent session timeout and congestion. The PRACK message is a response to an interim message such as an invite 183 message, for reliability transmission. The calling device sends a PRACK message to the IMS, which is an acknowledgement of the received invite 100 message and invite 183 message.
S322, the IMS sends PRACK information to the called equipment. Correspondingly, the called device receives the PRACK message.
S323, the called equipment sends a PRACK response message to the IMS, wherein the PRACK response message is a response to the PRACK message. Correspondingly, the IMS receives the PRACK response message from the called device.
The PRACK message may be, for example, a PRACK 200 message, which indicates that a PRACK is received.
S324, the IMS sends PRACK response information to the calling device. Correspondingly, the calling device receives the PRACK response message.
S325, the calling device sends an update message to the IMS, where the update message is used to request a media stream change. Correspondingly, the IMS receives an update message from the IMS.
It should be appreciated that the update message described above is one mechanism to update the media stream when a call is not established. The first call or response of the SDP is effected by a temporary response prior to the answer and the media stream is established. After this, the subsequent media stream change before the reply can be realized by an update message.
S326, the IMS sends an update message to the called equipment. Correspondingly, the called device receives an update message from the IMS.
S327, the called equipment sends an update response message to the IMS for responding to the update message. Correspondingly, the IMS receives the update response message.
The update response message may be, for example, an update 200 message.
And S328, the IMS sends an update response message to the calling device. Correspondingly, the calling device receives the update response message.
S329, the called device sends a ringing notification to the IMS, indicating that the called device is ringing. Correspondingly, the IMS receives the ring notification.
The above-mentioned ring notification may be, for example, an invite 180 message.
S330, the IMS sends the ringing notification to the calling equipment. Correspondingly, the calling device receives the ringing notification.
Thus, the calling device can determine that the called device has received the voice call request of the calling device, rings, and waits for the user of the called device to answer.
And S331, if the user of the called equipment answers the call, the called equipment sends an answer notification to the IMS, and the answer notification indicates that the user of the called equipment has answered. Correspondingly, the IMS receives the answer notification.
The answer notification may be, for example, an invite 200 message.
Next, the following two possible cases may occur:
in case 1, S332, the IMS sends the answer notification to the calling device, and the calling device normally receives the answer notification, and continues to execute the following steps:
s333, the calling device sends an answer response to the IMS and adjusts the call to the on state. Correspondingly, the IMS receives the answer response.
The answer response may be, for example, an invite Acknowledgement (ACK) message.
S334, the IMS sends an answer response to the called equipment. Correspondingly, the called device receives the answer response.
And S335, the called device sends real-time transport protocol (real-time transport protocol, RTP) data to the calling device through the network based on the received answer response. Correspondingly, the calling device receives the RTP data, and the calling user can hear the sound of the called user.
S336, the calling device sends RTP data to the called device through the network. Correspondingly, the called device receives the RTP data, and the called user can hear the sound of the calling user.
In case 2, the IMS cannot send the above-mentioned answer notification to the calling device, or the IMS sends the above-mentioned answer notification to the calling device, but the calling device does not receive it, which may be caused by network anomaly.
In this case, since the IMS receives the answer notification in S331, the IMS may execute S334, and the called device receives the answer response through S334, and may execute S335, at this time, the calling user may hear the sound of the called user, but since the above S332 and S333 cannot be executed, normal call connection cannot be established between the calling device and the called device, RTP data of the calling device cannot be transmitted to the called device, and the called user cannot hear the sound of the calling user, that is, a single-pass abnormal phenomenon occurs.
The above case 2 belongs to a scene of abnormal call, after the called party connects to the call invitation of the calling party, the called equipment displays the call connection, but the called party cannot hear the sound of the calling party, the calling party still displays the call, but the calling party can hear the sound of the called party, which causes the phenomenon of voice call single-pass, and has poor experience for users.
In view of this, the present application provides a voice call method and apparatus, in a voice call setup stage, a calling device sends an invite request for a voice call to a called device, after the calling device receives a ring notification sent by the called device, the calling device may determine the abnormal call scene in combination with the existence of a color ring service, the existence type of the color ring service, and all or part of information in sources of downlink RTP packets, and in the case that the abnormal call scene is identified, the calling device sends a confirmation message to the called device, and adjusts a local call state to be on, so that a single-pass fault between two parties of the voice call can be resolved as much as possible, the voice call can be saved as much as possible, so that the voice call can be normally performed, and call experience of two parties of the voice call is improved.
The following describes the voice call method provided by the embodiment of the present application in detail with reference to fig. 4.
Fig. 4 is a schematic flow chart of a voice call method 400 provided in the present application, where a calling device and a called device involved in the method 400 may be terminal devices, a hardware structure of the terminal devices may be shown in fig. 1, and a software architecture may be shown in fig. 2, but embodiments of the present application are not limited thereto. As shown in fig. 4, the method 400 may include the steps of:
s401, a calling device sends an invitation request of voice call to a called device; correspondingly, the called device receives the invite request.
The invite request may be, for example, an invite message.
S402, the called equipment sends a ringing notification of the voice call to the calling equipment based on the invitation request, wherein the ringing notification is used for indicating that the called equipment rings; correspondingly, the calling device receives the ringing notification.
The above-mentioned ring notification may be, for example, an invite 180 message.
S403, under the condition that the preset condition is met, the calling device sends a confirmation message to the called device so as to connect the call; correspondingly, the called device receives the confirmation message.
The acknowledgement message may be, for example, an invite ACK message.
The preset condition may be any one of the following conditions:
the called equipment has no color ring service, and the calling equipment receives a first downlink real-time transmission protocol RTP packet after ringing notification; or alternatively, the process may be performed,
the called equipment has audio color ring service, the calling equipment receives the first downlink RTP packet after the ringing notification, the sources of the first downlink RTP packet and the second downlink RTP packet are different, and the second downlink RTP packet is the downlink RTP packet received by the calling equipment before the ringing notification; or alternatively, the process may be performed,
the called equipment has video color ring service, the calling equipment receives the first downlink RTP packet after ringing notification, the sources of the first downlink RTP packet and the second downlink RTP packet are different, and the first downlink RTP packet does not comprise video data flow.
The called device has no color ring service and can be understood as: whether the called equipment subscribes to the CRBT service. If the called equipment subscribes to the CRBT service, the calling equipment can hear the CRBT when calling the called equipment; if the called equipment does not subscribe to the CRBT service, the calling equipment does not hear the CRBT when calling the called equipment.
S404, the calling device adjusts the local call state to be on. Specifically, the calling device may display the interface in an on state.
It should be understood that the preset conditions are used for the calling device to determine the abnormal call scenario, and if any one of the preset conditions is satisfied, the abnormal call scenario of the single-pass call described in the case 2 of fig. 3 is indicated. Specifically, after the ring notification, the called user puts through the call, and the called device sends voice data to the calling device. If the called equipment has the color ring service, after the ringing notification, before the called user connects the call, the color ring server can send color ring data to the calling equipment, and after the called user connects the call, the called equipment sends voice data to the calling equipment. Therefore, the above-mentioned abnormal call scenario is mainly judged by identifying whether the data packet (i.e. the above-mentioned first downlink RTP packet) received by the calling device after the ring notification is the color ring data or the voice data of the called device, i.e. identifying whether the source of the data packet is the color ring server or the called device. If the source of the data packet is the color ring server, it is possible that the called party does not connect the call, and the called equipment is still in the ringing state, the calling equipment can determine that the call is not the abnormal call scene at present, if the source of the data packet is the called equipment, it is indicated that the called party has connected the call, and the calling equipment can determine that the call is in the abnormal call scene at present.
When the abnormal passing scene is identified, the calling device sends a confirmation message to the called device, and the local call state is adjusted to be on, so that the single-pass fault between the two voice call parties can be solved as much as possible, and the voice call can be saved as much as possible.
The above-described preset conditions are analyzed in detail according to the following, respectively.
And (1) the called equipment does not have the CRBT service, and the calling equipment receives the first downlink RTP packet after the ringing notification.
Because the called equipment has no color ring service, the first downlink RTP packet received by the calling equipment after the ringing notification is the voice data sent by the called equipment, so that in this case, the calling equipment can determine that the called user has been connected to the call and is currently in an abnormal call scene.
And 2, the called equipment has audio color ring service, the calling equipment receives the first downlink RTP packet after the ringing notification, the sources of the first downlink RTP packet and the second downlink RTP packet are different, and the second downlink RTP packet is the downlink RTP packet received by the calling equipment before the ringing notification.
Because the called equipment has the color ring service, and the color ring service is the audio color ring service, after the ringing notification and before the called user is connected with the call, the color ring server can send color ring data to the calling equipment, so the calling equipment needs to judge whether the first downlink RTP packet received after the ringing notification is sent by the color ring server or sent by the called equipment. If the first downlink RTP packet is sent by the called equipment, the called equipment indicates that the called user has been connected with the call, and the calling equipment can determine that the call is in the abnormal call scene currently.
Specifically, the calling device receives a second downlink RTP packet from the ring back tone server before the ring back tone notification, where the second downlink RTP packet carries ring back tone data. The calling device can determine whether the first downlink RTP packet is sent by the color ring server or sent by the called device by judging whether the sources of the first downlink RTP packet and the second downlink RTP packet are the same. If the source of the first downlink RTP packet is the same as the source of the second downlink RTP packet, the first downlink RTP packet is sent by the color ring server, and if the source of the first downlink RTP packet is different from the source of the second downlink RTP packet, the first downlink RTP packet is sent by the called equipment.
And 3, the called equipment has video color ring service, the calling equipment receives the first downlink RTP packet after ringing notification, the sources of the first downlink RTP packet and the second downlink RTP packet are different, and the first downlink RTP packet does not comprise video data flow.
Because the called equipment has the color ring service and the color ring service is the video ring service, if the first downlink RTP packet does not comprise video data stream and the sources of the first downlink RTP packet from the second downlink RTP packet are different, the first downlink RTP packet is sent by the called equipment, which indicates that the called user has been connected with the call, and the calling equipment can determine that the call is in the abnormal call scene currently.
It should be understood that the video data stream may also be simply referred to as a video stream, and its format may be in an h.264 compression format or an h.265 compression format, which is not limited in this embodiment of the present application.
In the voice call method of the embodiment of the application, in the voice call establishment stage, the calling device sends the invitation request of the voice call to the called device, after receiving the ringing notification sent by the called device, the calling device can judge the abnormal call scene by combining all or part of information in the existence condition of the color ring service, the existence type of the color ring service and the source of the downlink RTP packet, and under the condition of recognizing the abnormal passing scene, the calling device sends the confirmation message to the called device and adjusts the local call state to be on, thus solving single-pass faults between two voice call parties as far as possible and saving the voice call as far as possible, so that the voice call can be normally carried out and the call experience of the two voice call parties is improved.
It should be understood that, in the present embodiment, the flow is simplified, and in practical application, the foregoing transmission and reception of the message needs to be forwarded through a network, which is not shown in fig. 4.
As an optional embodiment, before the calling device sends an acknowledgement message to the called device in S403, the method further includes: the calling device sends an update request to the called device, wherein the update request is used for requesting to change the media stream; the calling device receives an update response from the called device; and the calling device judges whether the called device has the color ring service or not based on the updating response.
The update request may be, for example, an update message, and the update response may be, for example, an update 200 message.
As an optional embodiment, the calling device determines, based on the update response, whether the called device has a color ring service, including: the calling device judges whether the updating response comprises a first field and/or a second field, wherein the first field is used for indicating an audio color ring service, and the second field is used for indicating a video color ring service; and if the updating response does not include the first field and the second field, the calling device determines that the called device has no CRBT service.
In the update response, there may be fields for indicating the color ring back tone service, that is, the first field and the second field, where the first field may be, for example, an audio (audio) field, including a=content: g.3gpp.cat, and the second field may be, for example, a video (video) field, including a=content: g.3gpp.cat, and carried in a session description protocol (session description protocol, SDP) of the update response.
If the update response does not include the first field or the second field, the calling device may determine that the called device has no color ring service.
If the update response includes the first field and does not include the second field, the calling device may determine that the called device has the audio color ring service.
If the update response includes the first field and the second field, the calling device may determine that the called device has the video color ring service.
By the method, whether the called equipment has the CRBT service or not and the corresponding CRBT service type when the called equipment has the CRBT service can be simply determined.
As an optional embodiment, before the calling device determines whether the called device has the color ring service based on the update response, the method further includes: the calling device judges whether a successful response of the invitation request is received or not; the calling device judges whether the called device has the color ring back tone service based on the updating response, and the method comprises the following steps: and if the calling equipment does not receive the successful response of the invitation request, the calling equipment judges whether the called equipment has the color ring back tone service or not based on the updating response.
In the embodiment of the application, if the calling device receives the successful response of the invitation request, the calling device can reply the confirmation message to the called device, and the call is connected without subsequent judgment. When the calling device does not receive the successful response of the invitation request, the abnormal call scene may appear, and the calling device may determine the abnormal call scene, and determine whether the called device has the color ring back tone service.
The successful reply to the invite request may be, for example, an invite 200 message.
Thus, the abnormal call scene can be more accurately identified, unnecessary judgment is avoided, and the establishment efficiency of the voice call is improved.
As an alternative embodiment, before the calling device receives the ring notification from the called device, the method further comprises: and under the condition that the called equipment has the color ring service, the calling equipment receives the second downlink RTP packet from the color ring server.
It should be understood that, in the case that the called device has the color ring service, the calling device may receive the second downlink RTP packet sent by the color ring server, that is, the source of the second downlink RTP packet is the color ring server.
As an optional embodiment, before the calling device sends an acknowledgement message to the called device, the method further comprises: under the condition that the called equipment has the color ring service, the calling equipment compares whether the synchronous information source (synchronization source, SSRC) of the first downlink RTP packet and the SSRC of the second downlink RTP packet are the same or not; and if the SSRC of the first downlink RTP packet is different from the SSRC of the second downlink RTP packet, the calling equipment determines that the sources of the first downlink RTP packet and the second downlink RTP packet are different.
In RTP, SSRC is used to represent the source from which the RTP packets are generated, i.e., the source of the RTP packets. For example, the color ring data of the color ring stage is from the color ring server, the voice data of the call stage is from the called device, and the SSRC of the two downlink RTP packets of the color ring stage and the call connection stage in the log are different.
The SSRC of the first downstream RTP packet is the same as the SSRC of the second downstream RTP packet, which indicates that the source of the first downstream RTP packet is the same as the source of the second downstream RTP packet; otherwise, if the SSRC of the first downstream RTP packet is different from the SSRC of the second downstream RTP packet, it means that the source of the first downstream RTP packet is different from the source of the second downstream RTP packet.
Therefore, the embodiment of the application determines that the current abnormal call scene is in through the fact that the calling equipment recognizes that the SSRC of the first downlink RTP packet and the SSRC of the second downlink RTP packet are different, and the calling equipment actively replies the confirmation message to the called equipment, so that the call is attempted to be saved.
The following describes the judgment logic of the calling device in detail in the calling device's angle in the embodiment of the present application.
Fig. 5 shows another voice call method 500, where the calling device and the called device involved in the method 500 may be terminal devices, the hardware structure of the terminal devices may be as shown in fig. 1, and the software architecture may be as shown in fig. 2, but the embodiment of the present application is not limited thereto. Method 500 is performed by a calling device, as shown in fig. 5, the method 500 may include the steps of:
S501, the calling device judges whether an invite 200 message is received after an invite 180 message, wherein the invite 180 message is a ringing message, and the invite 200 message is a successful response message.
If the calling device receives the invite 200 message, the flow is ended.
If the calling device does not receive the invite 200 message, execution continues with S502.
S502, the calling device judges whether the called device has the color ring back tone service. The specific determination method can be referred to the above description, and will not be repeated here.
If the called device has the color ring service, S503 is executed.
If the called device has no color ring service, S504 is executed.
S503, the calling device judges whether the CRBT service of the called device is an audio CRBT service.
If the color ring back tone service of the called device is an audio color ring back tone service, S505 is executed.
If the color ring back tone service of the called device is not the audio color ring back tone service (i.e., the color ring back tone service of the called device is the video color ring back tone service), S506 is executed.
S504, the calling device judges whether a downlink RTP packet is received after the invite 180 message.
If the calling device receives the downstream RTP packet after the invite 180 message, S509 is executed.
If the calling device does not receive the downlink RTP packet after the invite 180 message, the flow is ended.
S505, the calling device judges whether a downlink RTP packet is received after the invite 180 message.
If the calling device receives the downlink RTP packet after the invite 180 message, S507 is executed.
If the calling device does not receive the downlink RTP packet after the invite 180 message, the flow is ended.
S506, the calling device judges whether a downlink RTP packet is received after the invite 180 message.
If the calling device receives the downstream RTP packet after the invite 180 message, S508 is executed.
If the calling device does not receive the downlink RTP packet after the invite 180 message, the flow is ended.
S507, the calling device judges whether the SSRC of the downstream RTP packet received after the invite 180 message and the SSRC of the downstream RTP packet received before the invite 180 message are the same.
If so, S509 is performed.
If the two types of the data are different, the process is ended.
S508, the calling device judges whether the SSRC of the downstream RTP packet received after the invite 180 message is the same as the SSRC of the downstream RTP packet received before the invite 180 message, and whether the downstream RTP packet received after the invite 180 message has a video stream.
If the video streams are identical and there is no video stream, S509 is executed.
If different, and/or video streams exist, the process is ended.
S509, the calling device determines that the downlink RTP packet received after the invite 180 message is sent by the called device.
S510, the calling device sends a response message to the called device to switch on the call.
In this way, the calling device can combine the existence of the color ring service, the existence type of the color ring service and all or part of information in the source of the downlink RTP packet to judge the abnormal call scene, and when the abnormal call scene is identified, the calling device sends a confirmation message to the called device and adjusts the local call state to be on, so that the single-pass fault between the two voice call parties can be solved as much as possible, the voice call can be saved as much as possible, the voice call can be normally performed, and the call experience of the two voice call parties is improved.
Fig. 6 shows another voice call method 600, where the method 600 is optimized for the flow shown in fig. 3, and fig. 3 shows a flow chart of a no-color ring service, and on the basis of fig. 3, if an abnormal call scenario in case 2 described in fig. 3 occurs, a calling device can be identified by the method of the embodiment of the present application, and sends a corresponding answer to attempt to save the call.
S601 to S628 in the method 600 are the same as S301 to S328 in FIG. 3, and are not described here again.
Optionally, in the case that the called device has the color ring service, the method 600 further includes:
s629, the CRBT server sends RTP packet (namely the second downlink RTP packet) to the IMS, wherein the RTP packet carries CRBT data; correspondingly, the IMS receives the RTP packet.
S630, IMS sends the RTP packet to the calling device; correspondingly, the calling device receives the RTP packet.
It should be understood that, if the called device stores the audio ring back tone service, S625 and S628 (i.e. the update message and the update response message) are further executed between the calling device and the IMS before S630, so as to update the media stream to the data of the audio ring back tone service. If the called device stores the video coloring ring back tone service, S625 and S628 (i.e. the update message and the update response message) are performed twice between the calling device and the IMS before S630, and a bearer is also established to update the media stream to the data of the video coloring ring back tone service, which is not shown in fig. 6 above.
S631 to S638 are the same as S329 to S336 in FIG. 3, and are not described here again.
If an anomaly occurs, i.e., S634 and S636 are not transmitted or received, the method 600 further includes:
s639, the calling device sends an answer response to the IMS and adjusts the call to be in an on state; correspondingly, the IMS receives the answer response.
S640, the IMS sends a receiving response to the called equipment; correspondingly, the called device receives the answer response.
It should be understood that the sequence numbers of the above processes do not mean the order of execution, and the execution order of the processes should be determined by the functions and internal logic of the processes, and should not be construed as limiting the implementation process of the embodiments of the present application.
The voice call method according to the embodiment of the present application is described in detail above with reference to fig. 1 to 6, and the voice call apparatus according to the embodiment of the present application will be described in detail below with reference to fig. 7 and 8.
Fig. 7 illustrates a voice call apparatus 700 provided in an embodiment of the present application. The apparatus 700 includes: a transceiver unit 710 and a processing unit 720.
The transceiver unit 710 is configured to: sending an invitation request of voice call to called equipment; receiving a ringing notification from the called equipment, wherein the ringing notification is used for indicating that the called equipment rings; transmitting an acknowledgement message to the called device if any of the following conditions is met: the called equipment has no color ring service, and the device receives a first downlink real-time transmission protocol RTP packet after the ring notification; or the called equipment has audio color ring service, the device receives the first downlink RTP packet after the ringing notification, and the sources of the first downlink RTP packet and the second downlink RTP packet are different, wherein the second downlink RTP packet is the downlink RTP packet received by the device before the ringing notification; or the called equipment has video color ring service, the device receives the first downlink RTP packet after the ringing notification, the sources of the first downlink RTP packet and the second downlink RTP packet are different, and the first downlink RTP packet does not comprise video data flow; the processing unit 720 is configured to: the local call state is adjusted to be on.
Optionally, the transceiver unit 710 is further configured to: sending an update request to the called equipment, wherein the update request is used for requesting to change a media stream; receiving an update response from the called device; the processing unit 720 is further configured to: and judging whether the called equipment has the color ring back tone service or not based on the updating response.
Optionally, the processing unit 720 is further configured to: judging whether the update response comprises a first field and/or a second field, wherein the first field is used for indicating an audio color ring back tone service, and the second field is used for indicating a video color ring back tone service; and if the updating response does not include the first field and the second field, determining that the called equipment has no CRBT service.
Optionally, the processing unit 720 is further configured to: if the update response includes the first field and does not include the second field, determining that the called device has the audio color ring service.
Optionally, the processing unit 720 is further configured to: and if the update response comprises the first field and the second field, determining that the called equipment has the video color ring service.
Optionally, the processing unit 720 is further configured to: judging whether a successful response of the invitation request is received or not; and if the successful response of the invitation request is not received, judging whether the called equipment has the color ring back tone service or not based on the second downlink RTP packet.
Optionally, the transceiver unit 710 is further configured to: and receiving the second downlink RTP packet from the CRBT server under the condition that the called equipment has the CRBT service.
Optionally, the processing unit 720 is further configured to: comparing whether the synchronous source SSRC of the first downlink RTP packet and the SSRC of the second downlink RTP packet are the same or not under the condition that the called equipment has the color ring service; and if the SSRC of the first downlink RTP packet is different from the SSRC of the second downlink RTP packet, determining that the sources of the first downlink RTP packet and the second downlink RTP packet are different.
It should be understood that the apparatus 700 herein is embodied in the form of functional units. The term "unit" herein may refer to an application specific integrated circuit (application specific integrated circuit, ASIC), an electronic circuit, a processor (e.g., a shared, dedicated, or group processor, etc.) and memory that execute one or more software or firmware programs, a combinational logic circuit, and/or other suitable components that support the described functionality. In an alternative example, it will be understood by those skilled in the art that the apparatus 700 may be specifically a calling device in the foregoing embodiment, and the apparatus 700 may be used to perform each flow and/or step corresponding to the calling device in the foregoing method embodiment, which is not described herein for avoiding repetition.
The apparatus 700 of each of the above embodiments has a function of implementing the corresponding steps executed by the calling device in the above method; the functions may be implemented by hardware, or may be implemented by hardware executing corresponding software. The hardware or software includes one or more modules corresponding to the functions described above. For example, the transceiver unit 710 may include a transmitting unit and a receiving unit, where the transmitting unit may be configured to implement respective steps and/or procedures for performing a transmitting action corresponding to the transceiver unit, and the receiving unit may be configured to implement respective steps and/or procedures for performing a receiving action corresponding to the transceiver unit. The transmitting unit may be replaced by a transmitter and the receiving unit may be replaced by a receiver, performing the transceiving operations and the associated processing operations in the respective method embodiments, respectively.
In an embodiment of the present application, the apparatus 700 in fig. 7 may also be a chip or a chip system, for example: system on chip (SoC). Correspondingly, the transceiver unit 710 may be a transceiver circuit of the chip, which is not limited herein.
Fig. 8 illustrates another voice call apparatus 170 provided in an embodiment of the present application. The apparatus 170 includes a processor 810, a transceiver 820, and a memory 830. Wherein the processor 810, the transceiver 820 and the memory 830 are in communication with each other through an internal connection path, the memory 830 is configured to store instructions, and the processor 810 is configured to execute the instructions stored in the memory 830 to control the transceiver 820 to transmit and/or receive signals.
Wherein the transceiver 820 is configured to: sending an invitation request of voice call to called equipment; receiving a ringing notification from the called equipment, wherein the ringing notification is used for indicating that the called equipment rings; transmitting an acknowledgement message to the called device if any of the following conditions is met: the called equipment has no color ring service, and the device receives a first downlink real-time transmission protocol RTP packet after the ring notification; or the called equipment has audio color ring service, the device receives the first downlink RTP packet after the ringing notification, and the sources of the first downlink RTP packet and the second downlink RTP packet are different, wherein the second downlink RTP packet is the downlink RTP packet received by the device before the ringing notification; or the called equipment has video color ring service, the device receives the first downlink RTP packet after the ringing notification, the sources of the first downlink RTP packet and the second downlink RTP packet are different, and the first downlink RTP packet does not comprise video data flow; the processor 810 is configured to: the local call state is adjusted to be on.
It should be understood that the apparatus 170 may be specifically configured as the calling device in the foregoing embodiment, and may be configured to perform the steps and/or flows corresponding to the calling device in the foregoing method embodiment. The memory 830 may optionally include read-only memory and random access memory, and provide instructions and data to the processor. A portion of the memory may also include non-volatile random access memory. For example, the memory may also store information of the device type. The processor 810 may be configured to execute instructions stored in a memory, and when the processor 810 executes instructions stored in the memory, the processor 810 is configured to perform the steps and/or processes of the method embodiments described above with respect to the calling device. The transceiver 820 may include a transmitter that may be used to implement various steps and/or processes for performing transmit actions corresponding to the transceiver and a receiver that may be used to implement various steps and/or processes for performing receive actions corresponding to the transceiver.
It should be appreciated that in embodiments of the present application, the processor of the apparatus described above may be a central processing unit (central processing unit, CPU), which may also be other general purpose processors, digital Signal Processors (DSPs), application Specific Integrated Circuits (ASICs), field Programmable Gate Arrays (FPGAs) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
In implementation, the steps of the above method may be performed by integrated logic circuits of hardware in a processor or by instructions in the form of software. The steps of the method disclosed in connection with the embodiments of the present application may be embodied directly in a hardware processor for execution, or in a combination of hardware and software elements in the processor for execution. The software elements may be located in a random access memory, flash memory, read-only memory, programmable read-only memory or electrically erasable programmable memory, registers, etc. as well known in the art. The storage medium is located in a memory, and the processor executes instructions in the memory to perform the steps of the method described above in conjunction with its hardware. To avoid repetition, a detailed description is not provided herein.
Those of ordinary skill in the art will appreciate that the various illustrative modules and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, or combinations of computer software and electronic hardware. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the solution. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present application.
It will be clearly understood by those skilled in the art that, for convenience and brevity of description, specific working procedures of the above-described system, apparatus and module may refer to corresponding procedures in the foregoing method embodiments, which are not repeated herein.
In the several embodiments provided by the present application, it should be understood that the disclosed systems, devices, and methods may be implemented in other manners. For example, the apparatus embodiments described above are merely illustrative, and for example, the division of the modules is merely a logical function division, and there may be additional divisions when actually implemented, for example, multiple modules or components may be combined or integrated into another system, or some features may be omitted or not performed. Alternatively, the coupling or direct coupling or communication connection shown or discussed with each other may be an indirect coupling or communication connection via some interfaces, devices or modules, which may be in electrical, mechanical, or other forms.
The modules described as separate components may or may not be physically separate, and components shown as modules may or may not be physical modules, i.e., may be located in one place, or may be distributed over a plurality of network modules. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
In addition, each functional module in each embodiment of the present application may be integrated into one processing module, or each module may exist alone physically, or two or more modules may be integrated into one module.
The functions, if implemented in the form of software functional modules and sold or used as a stand-alone product, may be stored in a computer-readable storage medium. Based on this understanding, the technical solution of the present application may be embodied essentially or in a part contributing to the prior art or in a part of the technical solution, in the form of a software product stored in a storage medium, comprising several instructions for causing a computer device (which may be a personal computer, a server, a network device, etc.) to perform all or part of the steps of the method according to the embodiments of the present application. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a read-only memory (ROM), a random access memory (random access memory, RAM), a magnetic disk, or an optical disk, or other various media capable of storing program codes.
The foregoing is merely a specific implementation of the present application, but the scope of the embodiments of the present application is not limited thereto, and any person skilled in the art may easily think about changes or substitutions within the technical scope of the embodiments of the present application, and all changes and substitutions are included in the scope of the embodiments of the present application. Therefore, the protection scope of the embodiments of the present application shall be subject to the protection scope of the claims.

Claims (14)

1. A voice call method, comprising:
the calling device sends an invitation request of voice call to the called device;
the calling device receives a ringing notification from the called device, wherein the ringing notification is used for indicating that the called device rings;
the calling device sends a confirmation message to the called device and adjusts the local call state to be on when any one of the following conditions is met:
the called equipment does not have color ring service, and the calling equipment receives a first downlink real-time transmission protocol RTP packet after the ringing notification; or alternatively, the process may be performed,
the called equipment has audio color ring service, the calling equipment receives the first downlink RTP packet after the ringing notification, sources of the first downlink RTP packet and a second downlink RTP packet are different, and the second downlink RTP packet is the downlink RTP packet received by the calling equipment before the ringing notification; or alternatively, the process may be performed,
The called equipment has video color ring service, the calling equipment receives the first downlink RTP packet after the ringing notification, the sources of the first downlink RTP packet and the second downlink RTP packet are different, and the first downlink RTP packet does not comprise video data flow;
before the calling device sends the confirmation message to the called device, the method further comprises:
the calling device sends an update request to the called device, wherein the update request is used for requesting media stream change;
the calling device receives an update response from the called device;
the calling device judges whether a successful response of the invitation request is received or not;
and if the calling equipment does not receive the successful response of the invitation request, the calling equipment judges whether the called equipment has the color ring back tone service or not based on the updating response.
2. The method of claim 1, wherein the calling device determining whether the called device has a color ring service based on the update response comprises:
the calling device judges whether the updating response comprises a first field and/or a second field, wherein the first field is used for indicating an audio color ring service, and the second field is used for indicating a video color ring service;
And if the updating response does not include the first field and the second field, the calling device determines that the called device has no CRBT service.
3. The method according to claim 2, wherein the method further comprises:
if the update response includes the first field and does not include the second field, the calling device determines that the called device has the audio color ring service.
4. The method according to claim 2, wherein the method further comprises:
if the update response includes the first field and the second field, the calling device determines that the called device has the video color ring service.
5. The method of claim 1, wherein prior to the calling device receiving a ring notification from the called device, the method further comprises:
and under the condition that the called equipment has the color ring service, the calling equipment receives the second downlink RTP packet from the color ring server.
6. The method according to any one of claims 1 to 5, wherein before the calling device sends an acknowledgement message to the called device, the method further comprises:
Under the condition that the called equipment has the color ring service, the calling equipment compares whether the synchronous source SSRC of the first downlink RTP packet and the SSRC of the second downlink RTP packet are the same or not;
and if the SSRC of the first downlink RTP packet is different from the SSRC of the second downlink RTP packet, the calling equipment determines that the sources of the first downlink RTP packet and the second downlink RTP packet are different.
7. A voice call apparatus, comprising:
a receiving and transmitting unit, configured to send an invite request for voice call to a called device; receiving a ringing notification from the called equipment, wherein the ringing notification is used for indicating that the called equipment rings; transmitting an acknowledgement message to the called device if any of the following conditions is met:
the called equipment has no color ring service, and the device receives a first downlink real-time transmission protocol RTP packet after the ring notification; or alternatively, the process may be performed,
the called equipment has audio color ring service, the device receives the first downlink RTP packet after the ringing notification, and the sources of the first downlink RTP packet and the second downlink RTP packet are different, wherein the second downlink RTP packet is the downlink RTP packet received by the device before the ringing notification; or alternatively, the process may be performed,
The called equipment has video color ring service, the device receives the first downlink RTP packet after the ring notification, the sources of the first downlink RTP packet and the second downlink RTP packet are different, and the first downlink RTP packet does not comprise video data flow;
the processing unit is used for adjusting the local call state to be on;
the transceiver unit is further configured to:
sending an update request to the called equipment, wherein the update request is used for requesting to change a media stream;
receiving an update response from the called device;
the processing unit is further configured to:
judging whether a successful response of the invitation request is received or not;
and if the successful response of the invitation request is not received, judging whether the called equipment has the color ring back tone service or not based on the updating response.
8. The apparatus of claim 7, wherein the processing unit is further configured to:
judging whether the update response comprises a first field and/or a second field, wherein the first field is used for indicating an audio color ring back tone service, and the second field is used for indicating a video color ring back tone service;
and if the updating response does not include the first field and the second field, determining that the called equipment has no CRBT service.
9. The apparatus of claim 8, wherein the processing unit is further configured to:
if the update response includes the first field and does not include the second field, determining that the called device has the audio color ring service.
10. The apparatus of claim 8, wherein the processing unit is further configured to:
and if the update response comprises the first field and the second field, determining that the called equipment has the video color ring service.
11. The apparatus of claim 7, wherein the transceiver unit is further configured to:
and receiving the second downlink RTP packet from the CRBT server under the condition that the called equipment has the CRBT service.
12. The apparatus according to any one of claims 7 to 11, wherein the processing unit is further configured to:
comparing whether the synchronous source SSRC of the first downlink RTP packet and the SSRC of the second downlink RTP packet are the same or not under the condition that the called equipment has the color ring service;
and if the SSRC of the first downlink RTP packet is different from the SSRC of the second downlink RTP packet, determining that the sources of the first downlink RTP packet and the second downlink RTP packet are different.
13. A voice call apparatus, comprising: a processor coupled to a memory for storing a computer program which, when invoked by the processor, causes the apparatus to perform the method of any one of claims 1 to 6.
14. A computer readable storage medium storing a computer program comprising instructions for implementing the method of any one of claims 1 to 6.
CN202310671244.1A 2023-06-08 2023-06-08 Voice communication method and device Active CN116405594B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310671244.1A CN116405594B (en) 2023-06-08 2023-06-08 Voice communication method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310671244.1A CN116405594B (en) 2023-06-08 2023-06-08 Voice communication method and device

Publications (2)

Publication Number Publication Date
CN116405594A CN116405594A (en) 2023-07-07
CN116405594B true CN116405594B (en) 2023-10-20

Family

ID=87010899

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310671244.1A Active CN116405594B (en) 2023-06-08 2023-06-08 Voice communication method and device

Country Status (1)

Country Link
CN (1) CN116405594B (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018018892A1 (en) * 2016-07-27 2018-02-01 华为技术有限公司 Method for playing video ring back tone and calling user equipment
CN111212192A (en) * 2018-11-21 2020-05-29 中国电信股份有限公司 Method and device for playing voice when IMS fixed-line user dials VOLTE user
CN112866488A (en) * 2021-01-06 2021-05-28 中国联合网络通信集团有限公司 Video color ring back tone playing method, server and terminal
CN114095887A (en) * 2022-01-20 2022-02-25 荣耀终端有限公司 Call abnormity prompting method, communication system, electronic equipment and medium
CN115243206A (en) * 2022-09-21 2022-10-25 荣耀终端有限公司 Voice call establishing method and electronic equipment
WO2023039884A1 (en) * 2021-09-18 2023-03-23 华为技术有限公司 Call method and communication apparatus

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018018892A1 (en) * 2016-07-27 2018-02-01 华为技术有限公司 Method for playing video ring back tone and calling user equipment
CN111212192A (en) * 2018-11-21 2020-05-29 中国电信股份有限公司 Method and device for playing voice when IMS fixed-line user dials VOLTE user
CN112866488A (en) * 2021-01-06 2021-05-28 中国联合网络通信集团有限公司 Video color ring back tone playing method, server and terminal
WO2023039884A1 (en) * 2021-09-18 2023-03-23 华为技术有限公司 Call method and communication apparatus
CN114095887A (en) * 2022-01-20 2022-02-25 荣耀终端有限公司 Call abnormity prompting method, communication system, electronic equipment and medium
CN115243206A (en) * 2022-09-21 2022-10-25 荣耀终端有限公司 Voice call establishing method and electronic equipment

Also Published As

Publication number Publication date
CN116405594A (en) 2023-07-07

Similar Documents

Publication Publication Date Title
CN110138937B (en) Call method, device and system
CN113905179B (en) Method for switching cameras by terminal and terminal
US11703960B2 (en) Air mouse mode implementation method and related device
US11483359B2 (en) Voice call method, electronic device, and system
US20230247093A1 (en) Data Synchronization Method, Electronic Device, and Computer-Readable Storage Medium
CN114079893B (en) Bluetooth communication method, terminal device and computer readable storage medium
CN113448382B (en) Multi-screen display electronic device and multi-screen display method of electronic device
CN114390337A (en) Screen projection method and system and electronic equipment
EP3902328A1 (en) Network searching method, and terminal device
US20230125956A1 (en) Wireless Communication System and Method
CN113973398B (en) Wireless network connection method, electronic equipment and chip system
CN111343326A (en) Method and related device for acquiring test log
JP2022515446A (en) Connection establishment method and terminal device
CN114338913B (en) Fault diagnosis method, electronic device and readable storage medium
US11792631B2 (en) Emergency call method and user terminal
CN116389884B (en) Thumbnail display method and terminal equipment
WO2022152167A1 (en) Network selection method and device
CN114554012B (en) Incoming call answering method, electronic equipment and storage medium
CN116405594B (en) Voice communication method and device
CN114489876A (en) Text input method, electronic equipment and system
CN111801931B (en) Method for switching on and hanging up telephone when call occurs SRVCC switch
CN116708674B (en) Communication method and electronic equipment
CN114500725B (en) Target content transmission method, master device, slave device, and storage medium
CN116232959B (en) Network quality detection method and device
CN116321265B (en) Network quality evaluation method, electronic device and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant