WO2019144292A1 - Procédé de communication de réponse vidéo, et terminal - Google Patents

Procédé de communication de réponse vidéo, et terminal Download PDF

Info

Publication number
WO2019144292A1
WO2019144292A1 PCT/CN2018/073874 CN2018073874W WO2019144292A1 WO 2019144292 A1 WO2019144292 A1 WO 2019144292A1 CN 2018073874 W CN2018073874 W CN 2018073874W WO 2019144292 A1 WO2019144292 A1 WO 2019144292A1
Authority
WO
WIPO (PCT)
Prior art keywords
video
user terminal
format
message
terminal
Prior art date
Application number
PCT/CN2018/073874
Other languages
English (en)
Chinese (zh)
Inventor
李毅泉
Original Assignee
深圳前海达闼云端智能科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳前海达闼云端智能科技有限公司 filed Critical 深圳前海达闼云端智能科技有限公司
Priority to CN201880000051.0A priority Critical patent/CN108353145A/zh
Priority to PCT/CN2018/073874 priority patent/WO2019144292A1/fr
Publication of WO2019144292A1 publication Critical patent/WO2019144292A1/fr

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • H04N7/147Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/10Architectures or entities
    • H04L65/1016IP multimedia subsystem [IMS]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/75Media network packet handling
    • H04L65/762Media network packet handling at the source 
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72469User interfaces specially adapted for cordless or mobile telephones for operating the device by selecting functions from two or more displayed items, e.g. menus or icons
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/76Television signal recording

Definitions

  • the present disclosure relates to the field of communications, and in particular, to a video answering communication method and terminal.
  • the answering machine generally uses telephone voice to answer the phone.
  • the user pre-records the voice of the message into the answering machine. After receiving the call from the calling party, if no one answers after the predetermined time, the recorded voice message is sent to the calling party. After the caller hears the voice of the message, he or she will say the voice to be spoken. The caller's voice is sent to the answering machine for storage, and the user of the answering machine can make an after-the-fact inquiry.
  • the existing answering machines are for audio answering, and video answering is not possible.
  • a video answering communication method including:
  • the video data and the audio data of the second format are transmitted to the user terminal.
  • the method further includes:
  • the stored video from the user terminal is played back in response to a playback trigger.
  • the method before transmitting the video data and the audio data of the second format to the user terminal, the method further comprises: encapsulating the video data and audio of the second format in an IMS protocol stack and a telecommunication network protocol stack. data.
  • the telecommunications network protocol stack includes at least one of a Long Term Evolution (LTE) protocol stack and a WiFi protocol stack.
  • LTE Long Term Evolution
  • WiFi Wireless Fidelity
  • the receiving the video communication request from the user terminal includes:
  • the communication request is judged to be a video communication request.
  • the method after receiving the communication request from the user terminal, the method further includes:
  • the message voice is sent to the user terminal.
  • the message video of the first format is selected from a plurality of message videos based on a user terminal that transmitted the video communication request.
  • the message video of the first format is selected from a plurality of message videos based on a current time.
  • a video answering communication method including:
  • Video data and audio data of a second format are transmitted to the user terminal.
  • the method further includes:
  • the stored video from the user terminal is played back in response to a playback trigger.
  • the method after receiving the response from the user terminal, the method further includes:
  • a confirmation response that supports video playback is received from the user terminal.
  • the method after sending an inquiry request to the user terminal to support video playback, the method further includes:
  • the method before transmitting the video data and the audio data of the second format to the user terminal, the method further includes: encapsulating the first in an IP Multimedia System (IMS) protocol stack and a telecommunication network protocol stack Two formats of video data and audio data.
  • IMS IP Multimedia System
  • the telecommunications network protocol stack includes at least one of a Long Term Evolution (LTE) protocol stack and a WiFi protocol stack.
  • LTE Long Term Evolution
  • WiFi Wireless Fidelity
  • the message video of the first format is selected from a plurality of message videos based on a phone number of the user terminal.
  • the message video of the first format is selected from a plurality of message videos based on a current time.
  • a video answering communication terminal including:
  • a receiver configured to receive a video communication request from the user terminal
  • a processor configured to convert the message video of the first format into the video data and audio data of the negotiated second format in response to not receiving the call within a predetermined time after receiving the video communication request;
  • a transmitter configured to send video data and audio data in a second format to the user terminal.
  • the receiver is further configured to receive video from the user terminal after transmitting video data and audio data in a transmission format to the user terminal.
  • the terminal further includes a memory configured to store the video from the user terminal, and a display configured to play back the stored video from the user terminal in response to a playback trigger.
  • the processor is further configured to: package the video data of the second format with an IMS protocol stack and a telecommunication network protocol stack before transmitting video data and audio data of a second format to the user terminal. And audio data.
  • the telecommunications network protocol stack includes at least one of an LTE protocol stack and a WiFi protocol stack.
  • the receiver is further configured to receive a communication request from the user terminal prior to receiving the video communication request from the user terminal; the processor configured to determine that the communication request is a video communication request.
  • the processor is further configured to, after receiving a communication request from the user terminal, determine that the communication request is an audio call request; the transmitter is further configured to respond to receiving an audio call request If the call is not answered within the predetermined time, the message voice is sent to the user terminal.
  • the message video of the first format is selected from a plurality of message videos based on a user terminal transmitting the video communication request.
  • the message video of the first format is selected from a plurality of message videos based on a current time.
  • a video answering communication terminal including:
  • a transmitter configured to send a call request to the user terminal
  • a receiver configured to receive a response from the user terminal
  • a processor configured to convert the message video of the first format into the video data and audio data of the negotiated second format
  • the transmitter is further configured to transmit video data and audio data in a second format to the user terminal.
  • the receiver is further configured to receive video from the user terminal after transmitting video data and audio data in a transmission format to the user terminal.
  • the terminal further includes a memory configured to store the video from the user terminal, and a display configured to play back the stored video from the user terminal in response to a playback trigger.
  • the transmitter is further configured to, after receiving the response from the user terminal, send an inquiry request to the user terminal whether to support video playback.
  • the receiver is further configured to receive an acknowledgment response from the user terminal that supports video playback.
  • the receiver is further configured to receive an acknowledgment response from the user terminal that does not support video playback after transmitting an inquiry request to the user terminal to support video playback.
  • the transmitter is further configured to send a message voice to the user terminal.
  • the processor is further configured to: encapsulate the second format in an IMS protocol stack and a telecommunication network protocol stack before transmitting video data and audio data in a second format to the user terminal Video data and audio data.
  • the telecommunications network protocol stack includes at least one of a Long Term Evolution (LTE) protocol stack and a WiFi protocol stack.
  • LTE Long Term Evolution
  • WiFi Wireless Fidelity
  • the message video of the first format is selected from a plurality of message videos based on a phone number of the user terminal.
  • the message video of the first format is selected from a plurality of message videos based on a current time.
  • the key obstacle in the prior art that video answering is impossible is that the video message can be recorded on the answering machine, and the format of the recorded video message is also different from the video format that can be transmitted in the video communication, which is difficult to transmit in the video network. .
  • the pre-recorded message video of the answering machine user is converted into the video data and audio of the negotiated second format.
  • the data is sent to the sender user, and the video data and audio data of the second format are transmittable in the video communication. In this way, it is possible to achieve video answering.
  • FIG. 1 is a schematic structural diagram of an application of a video answering communication method according to an embodiment of the present disclosure.
  • FIG. 2 is a schematic diagram of a general structure of a video answering communication terminal according to an embodiment of the present disclosure.
  • FIG. 3 is an interaction flowchart of a video answering communication method for passive call answering according to an embodiment of the present disclosure.
  • FIG. 4 is an interaction flowchart of a video answering communication method for passive call answering according to an embodiment of the present disclosure.
  • FIG. 5 is an interaction flowchart of a video answering communication method for passive call answering according to an embodiment of the present disclosure.
  • FIG. 6 is an interaction flowchart of a video answering communication method for an active call answering according to an embodiment of the present disclosure.
  • FIG. 7 is an interaction flowchart of a video answering communication method for an active call answering according to an embodiment of the present disclosure.
  • FIG. 8 is an interaction flowchart of a video answering communication method for an active call answering according to an embodiment of the present disclosure.
  • embodiments of the present disclosure may be implemented as a system, apparatus, device, method, or computer program product. Accordingly, the present disclosure may be embodied in the form of full hardware, complete software (including firmware, resident software, microcode, etc.), or a combination of hardware and software.
  • FIG. 1 is a schematic structural diagram of an application of a video answering communication method according to an embodiment of the present disclosure.
  • the framework includes a user terminal 1, a video answering communication terminal 2, an IMS network 4, and a telecommunications network 3.
  • the IMS network 4 includes an IMS server 41.
  • the telecommunications network 3 comprises a telecommunications network server 31.
  • the video data is sent to the IMS network 4 through the telecommunication network 3, and then transmitted to the opposite end of the video communication through the telecommunication network 3.
  • IMS IP Multimedia Subsystem
  • IMS IP Multimedia Subsystem
  • a telecommunications network refers to a traditional telecommunications network.
  • the traditional telecommunication network it can support traditional services such as voice, but there is nothing that can be done for a mixed service that needs to transmit multiple media such as video, data, voice, and the like.
  • the telecommunications network 3 needs to be connected to the IMS network 4. It may include an LTE network or a WiFi network or the like.
  • LTE long term evolution, is a long-term evolution of the UMTS (Universal Mobile Telecommunications System) technology standard developed by the 3GPP (The 3rd Generation Partnership Project). It was officially launched and launched at the 3GPP Toronto Conference in December 2004.
  • WiFi is a technology that allows an electronic device to connect to a WLAN (wireless local area network).
  • the LTE network is connected to the IMS network, and the resulting network is a ViLTE network, in which video is transmitted to the IMS network through the LTE network.
  • the WiFi network is connected to the IMS network, and the obtained network is a video over WiFi network, wherein the video is sent to the IMS network through the WiFi network.
  • the video answering communication terminal 2 is a terminal having a video answering function. It can be a telephone answering machine, or it can be a smartphone with a telephone answering function, a computer terminal, and so on.
  • the user terminal 1 is a terminal on the opposite side of the video answering communication terminal 2. It can have an answering function or it can have no answering function. It can be a mobile phone, a fixed terminal, or a computer terminal with a calling function, and the like.
  • FIG. 2 is a schematic diagram of a general structure of a video answering communication terminal according to an embodiment of the present disclosure.
  • the video answering communication terminal includes: a memory 220, an input unit 230, a receiver 240, a transmitter 282, a processor 250, a power source 260, a display 270, an audio circuit 290, and the like.
  • a memory 220 the structure of the video answering communication terminal shown in FIG. 2 does not constitute a limitation of the video answering communication terminal, and may include more or less components than those illustrated, or combine some components. Or different parts arrangement.
  • the receiver 240 and the transmitter 282 can be used for receiving and transmitting signals, and receiving and transmitting signals, respectively, during a call or a call.
  • receiver 240 receives the communication signals for processing by processor 250; transmitter 282 transmits the processed communication signals from processor 250.
  • the internal structures of receiver 240 and transmitter 282 include, but are not limited to, an antenna or a receiving, transmitting unit, at least one amplifier, a coupler, a low noise amplifier (LNA), a duplexer, and the like.
  • Receiver 240 and transmitter 282 communicate with the network and other devices via wireless communication.
  • the wireless communication may use any communication standard or protocol, including but not limited to global system of mobile communication (GSM), general packet radio service (GPRS), code division multiple access (code) Division multiple access (CDMA), wideband code division multiple access (WCDMA), long term evolution (LTE), e-mail, short messaging service (SMS), and the like.
  • GSM global system of mobile communication
  • GPRS general packet radio service
  • code code division multiple access
  • WCDMA wideband code division multiple access
  • LTE long term evolution
  • SMS short messaging service
  • the memory 220 can be used to store software programs and modules that execute various functional applications and data processing of the video answering communication terminal by running software programs and modules stored in the memory 220.
  • the memory 220 may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application (Application, APP) and the like required for at least one function, such as a sound playing function, an image playing function, and the like; and storing the data.
  • the area may store data (such as audio data, image data, phone book, etc.) created according to the use of the mobile phone 200, and the like.
  • memory 220 can include high speed random access memory, and can also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device.
  • the input unit 230 can be configured to receive input numeric or character information and to generate key signal inputs related to user settings and function control of the video answering communication terminal.
  • the input unit 230 can include a push keypad 231 and other input devices 232.
  • the pressing keyboard 231 is a button area on the video answering communication terminal for the user to input the telephone number to be called.
  • Input unit 230 may also include other input devices 232.
  • other input devices 232 may include, but are not limited to, one or more of a touch screen, function keys (such as volume control buttons, power switch buttons, etc.), trackballs, mice, joysticks, and the like.
  • Display 270 can be used to display information entered by the user or information provided to the user as well as various menus of the video answering communication terminal.
  • the display panel 270 can include a display panel 271.
  • the display panel 271 can be configured in the form of a liquid crystal display (LCD), an organic light-emitting diode (OLED), or the like.
  • the touch screen may cover the display panel 271.
  • the touch screen 231 detects a touch operation thereon or nearby, it is transmitted to the processor 250 to determine the type of touch event, and then the processor 250 provides a corresponding visual output on the display panel 271 according to the type of touch event.
  • the touch screen and display panel 271 function as two separate components to implement the input and input functions of the video answering communication terminal, in some embodiments, the touch screen can be integrated with the display panel 271 to implement the video. Answer the input and output functions of the communication terminal.
  • the audio circuit 290, the speaker 291, and the microphone 292 can provide an audio interface between the user and the video answering communication terminal.
  • the audio circuit 290 can transmit the converted electrical data of the received audio data to the speaker 291 for conversion to the sound signal output by the speaker 291; on the other hand, the microphone 292 converts the collected sound signal into an electrical signal by the audio circuit 290. After receiving, it is converted into audio data, and the audio data is output to the memory 220 for further processing.
  • the processor 250 is a control center of the video answering communication terminal, and connects various parts of the entire video answering communication terminal by using various interfaces and lines. It performs various functions and processing data of the video answering communication terminal by running or executing software programs and/or modules stored in the memory 220, and calling data stored in the memory 220, thereby performing the video answering communication terminal. Overall monitoring.
  • the processor 250 may include one or more processing units; optionally, the processor 250 may integrate an application processor and a modem processor, where the application processor mainly processes an operating system, a user interface, and an application. Etc.
  • the modem processor primarily handles wireless communications. It will be appreciated that the above described modem processor may also not be integrated into the processor 250.
  • the video answering communication terminal further includes a power source 260 (such as a battery) for supplying power to various components.
  • a power source 260 such as a battery
  • the power source can be logically connected to the processor 250 through the power management system to manage charging, discharging, and power consumption through the power management system. Management and other functions.
  • a video answering communication method for passive call answering includes steps S101-S110.
  • step S101 the answering communication terminal 2 records the user's message video.
  • recording of the user's message video begins; in response to the user pressing the second button, recording of the user's message video is stopped.
  • the first button is, for example, a "recording” button on the answering communication terminal 2
  • the second button is, for example, a "stop recording” button on the answering communication terminal 2.
  • the first button and the second button may be the same button.
  • the first button and the second button are both "recording" buttons.
  • a message video is a video that is recorded in advance to be sent directly to the calling party during a phone call.
  • step S101 may further include recording a plurality of videos corresponding to different incoming users, or recording a plurality of videos corresponding to different time segments.
  • Step S101 is not required by the present disclosure or must be performed. In some cases, step S101 may be omitted.
  • the system can preset the message video without the user's recording.
  • step S102 the video answering communication terminal receives a video communication request from the user terminal.
  • a video communication request is a request for video communication.
  • the user of the user terminal presses the "Video Call” button on the user terminal to send a video communication request.
  • step S103 the video answering communication terminal makes an incoming call prompt.
  • Incoming call alerts can be made in a variety of ways. One way is ringing, one way is vibration, and the other way is lighting.
  • step S101 is not required by the present disclosure or must be performed.
  • step S101 may be omitted. For example, when the user does not want to be disturbed, it can be set to have no incoming call prompt, so that all calls accessed during this time are instead processed by video answering.
  • step S104 the video answering communication terminal converts the message video of the first format into the video data and audio data of the negotiated second format in response to the unanswered time within a predetermined time after receiving the video communication request.
  • the predetermined time may be set by the system at the time of shipment from the factory, or may be set by the user of the video answering communication terminal.
  • the negotiated second format refers to a format that can be transmitted in the video transmission architecture shown in FIG. 1 in advance with the user terminal. For example, for the video data in the video, the H.264 format can be used; for the audio data in the video, a format acceptable to the user terminal side can be adopted.
  • step S105 the video answering communication terminal encapsulates the video data and the audio data of the second format in an IP Multimedia System (IMS) protocol stack and a telecommunication network protocol stack.
  • IMS IP Multimedia System
  • the video answering communication terminal 2 sends the video data and audio data of the second format to the IMS protocol stack of the local machine, and is encapsulated by the IMS protocol stack and encapsulated into a format identifiable by the IMS server 41. Then, the video data and the audio data of the second format encapsulated by the IMS protocol stack are re-issued to the telecommunication network protocol stack encapsulation and encapsulated into a format recognizable by the telecommunication network server 31.
  • the telecommunications network protocol stack includes at least one of a Long Term Evolution (LTE) protocol stack and a WiFi protocol stack.
  • LTE Long Term Evolution
  • WiFi Wireless Fidelity
  • step S106 the video answering communication terminal transmits the video data and the audio data of the second format to the user terminal.
  • the video answering communication terminal 2 transmits the video data and the audio data of the second format to the telecommunication network server 31 on the side of the video answering communication terminal 2. Then, the telecommunication network server 31 on the side of the video answering communication terminal 2 transmits video data and audio data of the second format to the IMS server 41. Then, the IMS server 41 transmits video data and audio data of the second format to the telecommunication network server 31 on the user terminal 1 side.
  • the pre-recorded message video of the answering machine user is converted into the negotiated second format video data and audio data to be sent to The sender user, and the video data and audio data of the second format are transmittable in video communication. In this way, the user of the user terminal can see the video recorded by the user on the answering machine side in advance, thereby starting the video answering.
  • steps S107-S110 are subsequent processes of recording a video for the user on the user terminal side.
  • the user terminal 1 displays a message video.
  • the user terminal 1 after receiving the video data and the audio data of the second format, decapsulates the video data and audio of the second format through its own telecommunication network protocol stack and an IP multimedia system (IMS) protocol stack. Data, displaying a message video based on the decapsulated video data and audio data.
  • IMS IP multimedia system
  • step S108 the answering communication terminal 2 receives the video from the user terminal 1.
  • the first function of the call interface is triggered, for example, the “video message” on the touch interface is soft. Press the button and then video recording. After the recording, the user triggers the second function on the interface, such as the “end video message” soft key of the touch interface, and the recording ends. After the recording ends, the user terminal 1 transmits the recorded video to the answering communication terminal 2 through the telecommunication network server 31 and the IMS server 41.
  • step S109 the answering communication terminal 2 stores the video from the user terminal
  • step S110 the answering communication terminal 2 plays back the stored video from the user terminal in response to the playback trigger.
  • the playback trigger is activating action for playback of the stored video from the user terminal.
  • the playback trigger is accomplished by pressing a particular button on the answering communication terminal 2.
  • the playback trigger is achieved by identifying the voice of the user of the answering communication terminal 2.
  • the embodiment of the present disclosure can not only automatically transmit the message video pre-recorded on the answering communication terminal 2 side to the user terminal, but also can provide the user terminal side to the user of the answering communication terminal 2 to video message through steps S108-S110.
  • the types of supported communications differ depending on the type of user terminal with which it communicates. For example, some user terminals may only support voice communication and do not support video communication. Converting the message video of the first format to the negotiated video data and audio data of the second format each time without receiving a communication request from the user terminal increases the network load and the positive response of the user terminal. . Therefore, in the following embodiments, the communication request of the user terminal is first determined, and different strategies are adopted according to the type of communication supported by the user terminal, thereby improving network throughput and transmission efficiency.
  • step S102 includes: S1022: receiving a communication request from a user terminal; S1023, determining that the communication request is a video communication request.
  • determining whether the communication request is a video communication request is accomplished by determining whether a particular field in the communication request contains a first identification.
  • the first identification is an identification indicating that the communication request is a video communication request.
  • Fig. 4 The other parts of Fig. 4 are substantially identical to Fig. 3.
  • step S1022 If it is determined after step S1022 that the communication request is an audio call request, as shown in Fig. 5, the process proceeds to step S1022'.
  • determining whether the communication request is an audio call request is accomplished by determining whether a particular field in the communication request contains a second identity.
  • the second identification is an identification indicating that the communication request is an audio call request.
  • step S103 the video answering communication terminal makes an incoming call prompt.
  • the method of incoming call alert is as described above.
  • step S106' if a message is received in response to the predetermined time after receiving the audio call request, the message voice is transmitted to the user terminal.
  • the user terminal 1 plays a message voice.
  • step S108' the answering communication terminal 2 receives the voice from the user terminal 1.
  • the third function of the call interface is triggered, for example, the voice message on the touch interface is soft. Press the button and then make a voice recording. After recording, the user triggers the fourth function on the interface, such as the “End Voice Message” softkey on the touch interface, and the recording ends. After the recording ends, the user terminal 1 transmits the recorded voice to the answering communication terminal 2 through the telecommunication network server 31 and the IMS server 41.
  • step S109' the answering communication terminal 2 stores the voice from the user terminal.
  • step S110' the answering communication terminal 2 plays back the stored voice from the user terminal in response to the playback trigger.
  • the playback trigger is activating action for playback of the stored voice from the user terminal.
  • the playback trigger is accomplished by pressing a particular button on the answering communication terminal 2.
  • the playback trigger is achieved by identifying the voice of the user of the answering communication terminal 2.
  • a plurality of message videos corresponding to different user terminals are stored in the answering communication terminal 2.
  • the message video of the first format is selected from a plurality of message videos based on a user terminal that transmits the video communication request. Specifically, after receiving the video communication request from the user terminal, the user terminal number that issued the video communication request is judged from the video communication request. Selecting a message video corresponding to the user terminal number from the plurality of message videos according to the user terminal number.
  • dad and mom are specifically tailored to the message video displayed to them, and for other user terminals' video communication requests, a normal message video is displayed to them.
  • This embodiment improves the customization of the answering service.
  • a plurality of message videos corresponding to different times are stored in the answering communication terminal 2.
  • the message video of the first format is selected from a plurality of message videos based on the time when the video communication request is received. Specifically, after receiving the video communication request from the user terminal, the current time is determined, and the message video corresponding to the current time is selected from the plurality of message videos according to the current time.
  • a user may wish to display a video of a user who is on the job and unable to answer the call to the user who sent the video communication request during business hours, and a video of the user who is not at the phone or who does not carry the call to the user who sent the video communication request during the leisure time.
  • This embodiment improves the time customization of the answering service.
  • a video answering communication method for an active call answering is also provided, including steps S201-S213.
  • Active call answering means that the user terminal does not have a call but actively pushes the answering method of the recorded video message. It can be widely used in advertising pushes and the like.
  • step S201 the answering communication terminal 2 records a message video.
  • the recorded message video may be an advertisement video.
  • step S202 the answering communication terminal 2 transmits a call request to the user terminal 1.
  • an address book containing a plurality of user terminal phone numbers may be pre-stored in the answering communication terminal 2.
  • Step S202 can be implemented by sequentially calling the telephone number in the address book. In this way, large areas of coverage for the user terminal, such as advertisements, can be achieved.
  • step S203 the user terminal 1 makes an incoming call alert.
  • step S204 the answering communication terminal 2 receives the response of the user terminal.
  • step S207 the answering communication terminal 2 converts the message video of the first format into the video data and audio data of the negotiated second format.
  • step S104 The method of converting the message video of the first format into the video data and audio data of the negotiated second format is similar to step S104.
  • step S208 the answering communication terminal 2 encapsulates the video data and the audio data of the second format in an IP Multimedia System (IMS) protocol stack and a telecommunication network protocol stack.
  • IMS IP Multimedia System
  • Step S208 is similar to step S105.
  • the answering communication terminal 2 transmits video data and audio data of the second format to the user terminal 1.
  • Step 209 is similar to step S106.
  • the message video pre-recorded by the user on the communication terminal side is converted into the video data and audio data of the negotiated second format and sent to the sender.
  • User, and the video data and audio data of the second format are transmittable in video communication. In this way, it is possible to actively push a pre-recorded video to a large number of users.
  • the user terminal 1 displays a message video.
  • This step is similar to step S107.
  • step S211 the answering communication terminal 2 receives the video from the user terminal 1.
  • This step is similar to step S108.
  • step S212 the answering communication terminal 2 stores the video from the user terminal.
  • This step is similar to step S109.
  • step S213 the answering communication terminal 2 plays back the stored video from the user terminal in response to the playback trigger.
  • This step is similar to step S110.
  • the embodiment of the present disclosure can not only actively push the pre-recorded message video on the answering communication terminal 2 side to a large number of user terminals, but also can perform convenient video interaction with a large number of user terminals through the above steps S210-S213.
  • the user terminal's response to the message video (including actions, sounds) can be monitored. Especially in the context of advertising, it can help analyze the effectiveness of advertising and monitor the response of the audience in a timely manner.
  • the following embodiments can adopt different strategies according to different communication types supported by the user terminal, thereby improving network throughput.
  • the method further includes: S205, the answering communication terminal 2 sends an inquiry request for supporting video playback to the user terminal; S206, the answering communication terminal 2 Receiving a confirmation response supporting video playback from the user terminal.
  • Step S207-S213 are performed only when the answering communication terminal 2 receives an acknowledgment response supporting video playback from the user terminal.
  • step S206' when the answering communication terminal 2 receives an acknowledgment response not supporting video playback from the user terminal, in step 209', the answering communication terminal 2 transmits a message voice to the user terminal.
  • the user terminal 1 plays a message voice.
  • This step is similar to step S107'.
  • step S211' the answering communication terminal 2 receives the voice from the user terminal 1.
  • This step is similar to step S108'.
  • step S212' the answering communication terminal 2 stores the voice from the user terminal.
  • This step is similar to step S109'.
  • step S213' the answering communication terminal 2 plays back the stored voice from the user terminal in response to the playback trigger.
  • This step is similar to step S110'.
  • a plurality of message videos corresponding to different user terminals are stored in the answering communication terminal 2.
  • the message video of the first format is selected from a plurality of message videos based on a user terminal that transmits the video communication request.
  • the user terminal number from which the video communication request was issued is determined from the video communication request. Selecting a message video corresponding to the user terminal number from the plurality of message videos according to the user terminal number.
  • the Cantonese video advertisement is pushed to the user terminal; if the user terminal number is a Shanghai number, the Shanghainese video advertisement is pushed to the user terminal.
  • This embodiment improves the targeting of the video push service.
  • a plurality of message videos corresponding to different times are stored in the answering communication terminal 2.
  • the message video of the first format is selected from a plurality of message videos based on the time at which the response of the user terminal is received. Specifically, after receiving the response from the user terminal, the current time is determined, and the message video corresponding to the current time is selected from the plurality of message videos according to the current time.
  • a restaurant may have Chinese food and dinner. Based on the current time, decide whether to push the Chinese food video or push the dinner video.
  • This embodiment improves the time customization of the answering service.
  • a video answering communication terminal of a passive call answering includes:
  • the receiver 240 is configured to receive a video communication request from the user terminal
  • the processor 250 is configured to convert the message video of the first format into the video data and audio data of the negotiated second format in response to not receiving the call within a predetermined time after receiving the video communication request;
  • the transmitter 282 is configured to send the video data and the audio data of the second format to the user terminal.
  • the receiver 240 is further configured to receive video from the user terminal after transmitting video data and audio data in a transmission format to the user terminal.
  • the terminal further includes a memory 220 configured to store the video from the user terminal, and a display 270 configured to play back the stored video from the user terminal in response to a playback trigger.
  • the processor 250 is further configured to package the video in the second format with an IMS protocol stack and a telecommunication network protocol stack before transmitting the video data and audio data of the second format to the user terminal. Data and audio data.
  • the telecommunications network protocol stack includes at least one of an LTE protocol stack and a WiFi protocol stack.
  • the receiver 240 is further configured to receive a communication request from the user terminal prior to receiving the video communication request from the user terminal; the processor 250 configured to determine that the communication request is a video communication request.
  • the processor 250 is further configured to determine that the communication request is an audio call request after receiving a communication request from the user terminal; the transmitter 282 is further configured to be responsive to receiving the audio The call is not received within the predetermined time after the call request, and the message voice is sent to the user terminal.
  • the message video of the first format is selected from a plurality of message videos based on a user terminal transmitting the video communication request.
  • the message video of the first format is selected from a plurality of message videos based on a current time.
  • an active answering and answering video answering communication terminal including:
  • the transmitter 282 is configured to send a call request to the user terminal
  • the receiver 240 is configured to receive a response of the user terminal.
  • the processor 250 is configured to convert the message video of the first format into the video data and the audio data of the negotiated second format, and
  • the transmitter 282 is further configured to transmit video data and audio data in a second format to the user terminal.
  • the receiver 240 is further configured to receive video from the user terminal after transmitting video data and audio data in a transmission format to the user terminal.
  • the terminal further includes a memory 220 configured to store the video from the user terminal, and a display 270 configured to play back the stored video from the user terminal in response to a playback trigger.
  • the transmitter 282 is further configured to, after receiving a response from the user terminal, send an inquiry request to the user terminal whether to support video playback.
  • the receiver 240 is also configured to receive an acknowledgment response from the user terminal that supports video playback.
  • the receiver 240 is further configured to receive an acknowledgment response from the user terminal that does not support video playback after transmitting an inquiry request to the user terminal to support video playback.
  • the transmitter 282 is also configured to send a message voice to the user terminal.
  • the processor 250 is further configured to encapsulate the second format with an IMS protocol stack and a telecommunications network protocol stack before transmitting the video data and audio data of the second format to the user terminal. Video data and audio data.
  • the telecommunications network protocol stack includes at least one of a Long Term Evolution (LTE) protocol stack and a WiFi protocol stack.
  • LTE Long Term Evolution
  • WiFi Wireless Fidelity
  • the message video of the first format is selected from a plurality of message videos based on a phone number of the user terminal.
  • the message video of the first format is selected from a plurality of message videos based on a current time.
  • modules or units of equipment for action execution are mentioned in the detailed description above, such division is not mandatory. Indeed, in accordance with embodiments of the present disclosure, the features and functions of two or more modules or units described above may be embodied in one module or unit. Conversely, the features and functions of one of the modules or units described above may be further divided into multiple modules or units.
  • the components displayed as modules or units may or may not be physical units, ie may be located in one place or may be distributed over multiple network elements. Some or all of the modules may be selected according to actual needs to achieve the purpose of the wood disclosure scheme. Those of ordinary skill in the art can understand and implement without any creative effort.
  • a computer readable storage medium having stored thereon a computer program, the program being executable by the processor to implement the steps of the method of any one of the above embodiments.
  • the computer readable storage medium may be a ROM, a random access memory (RAM), a CD-ROM, a magnetic tape, a floppy disk, and an optical data storage device.
  • a computing device that can be applied to a server to be paired with a plurality of clients, and includes a processor, and a memory for storing executable instructions of the processor .
  • the processor is configured to cause the server to perform the steps of the method in any one of the above embodiments via execution of the executable instructions.
  • the computing device may be a mobile terminal such as a mobile phone or a tablet computer, or may be a terminal device such as a desktop computer or a server. This is not limited in this embodiment.
  • the example embodiments described herein may be implemented by software or by software in combination with necessary hardware. Therefore, the technical solution according to an embodiment of the present disclosure may be embodied in the form of a software product, which may be stored in a non-volatile storage medium (which may be a CD-ROM, a USB flash drive, a mobile hard disk, etc.) or on a network.
  • a non-volatile storage medium which may be a CD-ROM, a USB flash drive, a mobile hard disk, etc.
  • a number of instructions are included to cause a computing device (which may be a personal computer, server, touch terminal, or network device, etc.) to perform the above-described methods in accordance with embodiments of the present disclosure.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Human Computer Interaction (AREA)
  • Telephonic Communication Services (AREA)
  • Telephone Function (AREA)

Abstract

L'invention concerne un procédé de communication de réponse vidéo, comprenant les étapes consistant à : recevoir une requête de communication vidéo en provenance d'un terminal utilisateur ; en réponse au fait que la requête de communication vidéo ne reçoit pas de réponse pendant une durée prédéfinie après que la requête de communication vidéo a été reçue, convertir une vidéo de message dans un premier format en données vidéo et audio dans un second format négocié ; et envoyer les données vidéo et audio dans le second format au terminal utilisateur.
PCT/CN2018/073874 2018-01-23 2018-01-23 Procédé de communication de réponse vidéo, et terminal WO2019144292A1 (fr)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201880000051.0A CN108353145A (zh) 2018-01-23 2018-01-23 视频答录通信方法和终端
PCT/CN2018/073874 WO2019144292A1 (fr) 2018-01-23 2018-01-23 Procédé de communication de réponse vidéo, et terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2018/073874 WO2019144292A1 (fr) 2018-01-23 2018-01-23 Procédé de communication de réponse vidéo, et terminal

Publications (1)

Publication Number Publication Date
WO2019144292A1 true WO2019144292A1 (fr) 2019-08-01

Family

ID=62955690

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/073874 WO2019144292A1 (fr) 2018-01-23 2018-01-23 Procédé de communication de réponse vidéo, et terminal

Country Status (2)

Country Link
CN (1) CN108353145A (fr)
WO (1) WO2019144292A1 (fr)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11157738B2 (en) * 2018-11-30 2021-10-26 Cloudminds Robotics Co., Ltd. Audio-visual perception system and apparatus and robot system
CN110719426B (zh) * 2019-10-10 2022-09-09 腾讯科技(深圳)有限公司 一种视频留言方法、相关装置和存储介质
CN114598684A (zh) * 2020-12-03 2022-06-07 中兴通讯股份有限公司 视频问候语播放方法、系统、服务器和存储介质

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6438216B1 (en) * 1998-07-30 2002-08-20 Siemens Information And Communication Networks, Inc. Nonintrusive call notification method and system using content-specific information
CN101132437A (zh) * 2007-09-24 2008-02-27 宇龙计算机通信科技(深圳)有限公司 一种多待机终端及其自动应答方法及装置
CN102256216A (zh) * 2010-05-17 2011-11-23 华为软件技术有限公司 网络通信留言方法、邮箱业务平台及系统
CN104092840A (zh) * 2013-04-01 2014-10-08 中国移动通信集团江苏有限公司 一种留言方法、平台及系统

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1921518B (zh) * 2006-08-21 2010-05-12 华为技术有限公司 录制设备、存储服务器、录制系统及方法和回放系统及方法
CN101552826B (zh) * 2009-05-04 2012-01-11 中兴通讯股份有限公司 可视电话业务自动答录的方法和装置
US9661135B1 (en) * 2014-11-25 2017-05-23 Shoretel, Inc. Adaptive communication handling

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6438216B1 (en) * 1998-07-30 2002-08-20 Siemens Information And Communication Networks, Inc. Nonintrusive call notification method and system using content-specific information
CN101132437A (zh) * 2007-09-24 2008-02-27 宇龙计算机通信科技(深圳)有限公司 一种多待机终端及其自动应答方法及装置
CN102256216A (zh) * 2010-05-17 2011-11-23 华为软件技术有限公司 网络通信留言方法、邮箱业务平台及系统
CN104092840A (zh) * 2013-04-01 2014-10-08 中国移动通信集团江苏有限公司 一种留言方法、平台及系统

Also Published As

Publication number Publication date
CN108353145A (zh) 2018-07-31

Similar Documents

Publication Publication Date Title
US8285336B2 (en) Wireless device for voice communication
CN105657138B (zh) 呼叫处理方法及通信终端
US20020097850A1 (en) Seamless message retrieval and transmittal during wireless application protocol session
US10320980B2 (en) User device detection and integration for an IVR system
US10397341B2 (en) Method and apparatus for migrating active communication session between terminals
WO2019144292A1 (fr) Procédé de communication de réponse vidéo, et terminal
US9992343B2 (en) Text translation of an audio recording during recording capture
US9106751B2 (en) Voicemail pausing
CN105704106B (zh) 一种可视化ivr实现方法及移动终端
WO2015014174A1 (fr) Procédé et appareil de lecture de rappel, et système de service de rappel
WO2016107511A1 (fr) Procédé, terminal et système de communication vidéo
WO2018196548A1 (fr) Procédé et terminal de traitement de téléphone de réseau
CN113329127A (zh) 通话处理方法、通话处理装置、电子设备及介质
US9137743B2 (en) Method and mobile device for automatically choosing communication network
US20140179294A1 (en) Electronic device and method for transferring communication session
US8385962B1 (en) Push-to-talk voice messages
KR100724928B1 (ko) 이동통신 시스템에서PTT(Push―To―Talk)방식의 통화 알림 장치및 방법
JP5664458B2 (ja) 無線端末
JP2009246442A (ja) 情報処理装置、メールサーバ装置および発呼制御方法
WO2018153194A1 (fr) Procédé, dispositif et système d'acquisition de données de diffusion en continu
CN113596748B (zh) 实现呼叫保持功能的方法、服务器及用户终端
CN103856645A (zh) 一种离线呼叫的方法、装置及系统
JP5599441B2 (ja) 安否確認システム及び安否確認方法
JP2011234287A (ja) 伝言通信システム
WO2017037590A1 (fr) Procédé et système pour fournir des tonalités de retour d'appel indépendantes d'un opérateur de réseau

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18901851

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 23.11.2020)

122 Ep: pct application non-entry in european phase

Ref document number: 18901851

Country of ref document: EP

Kind code of ref document: A1