WO2021098708A1 - 通话方法及终端设备 - Google Patents
通话方法及终端设备 Download PDFInfo
- Publication number
- WO2021098708A1 WO2021098708A1 PCT/CN2020/129662 CN2020129662W WO2021098708A1 WO 2021098708 A1 WO2021098708 A1 WO 2021098708A1 CN 2020129662 W CN2020129662 W CN 2020129662W WO 2021098708 A1 WO2021098708 A1 WO 2021098708A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- information
- call
- terminal
- voice
- call terminal
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 73
- 238000004891 communication Methods 0.000 claims description 14
- 238000004590 computer program Methods 0.000 claims description 12
- 239000000463 material Substances 0.000 claims description 6
- 238000006243 chemical reaction Methods 0.000 claims description 5
- 230000006870 function Effects 0.000 description 31
- 238000010586 diagram Methods 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 5
- 238000012545 processing Methods 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000003287 optical effect Effects 0.000 description 3
- 238000013473 artificial intelligence Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 239000012774 insulation material Substances 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- KLDZYURQCUYZBL-UHFFFAOYSA-N 2-[3-[(2-hydroxyphenyl)methylideneamino]propyliminomethyl]phenol Chemical compound OC1=CC=CC=C1C=NCCCN=CC1=CC=CC=C1O KLDZYURQCUYZBL-UHFFFAOYSA-N 0.000 description 1
- 230000001133 acceleration Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 201000001098 delayed sleep phase syndrome Diseases 0.000 description 1
- 208000033921 delayed sleep phase type circadian rhythm sleep disease Diseases 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000007599 discharging Methods 0.000 description 1
- 230000005484 gravity Effects 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
- 238000010897 surface acoustic wave method Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/72—Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
- H04M1/724—User interfaces specially adapted for cordless or mobile telephones
- H04M1/72403—User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
- H04M1/72406—User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality by software upgrading or downloading
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/72—Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
- H04M1/724—User interfaces specially adapted for cordless or mobile telephones
- H04M1/72403—User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
- H04M1/7243—User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages
- H04M1/72433—User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages for voice messaging, e.g. dictaphones
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/72—Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
- H04M1/724—User interfaces specially adapted for cordless or mobile telephones
- H04M1/72403—User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
- H04M1/7243—User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages
- H04M1/72436—User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages for text messaging, e.g. short messaging services [SMS] or e-mails
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/72—Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
- H04M1/724—User interfaces specially adapted for cordless or mobile telephones
- H04M1/72448—User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
Definitions
- the embodiments of the present invention relate to the field of communication technology, and in particular, to a call method and terminal equipment.
- the call quality When the user is in a call, if the noise in the call environment is high, the call quality will be poor. In order to improve the quality of the call, the usual method currently used is to increase the volume of the call or move to a place with less noise. However, in the case where the user cannot clearly hear the voice of the opposite end when the call volume is adjusted to the highest level, or cannot move to a place with low noise, the call quality cannot be guaranteed even after the user adopts the above method.
- the embodiments of the present invention provide a call method and terminal equipment to solve the existing problem of poor call quality due to high noise in the call environment.
- the present invention is implemented as follows:
- an embodiment of the present invention provides a call method, which is applied to a first call terminal, where the first call terminal includes a voice assistant; the method includes:
- the target call terminal is the first call terminal
- the first information is text information
- the second information is voice information
- the target call terminal is the communication with the first call terminal.
- the first information is voice information
- the second information is text information.
- an embodiment of the present invention also provides a terminal device, the terminal device is a first call terminal, the terminal device includes a voice assistant; the terminal device includes:
- An obtaining module configured to obtain the first information of the target call terminal through the voice assistant when the voice assistant is turned on;
- a conversion module configured to convert the first information into second information through the voice assistant
- An output module for outputting the second information
- the target call terminal is the first call terminal
- the first information is text information
- the second information is voice information
- the target call terminal is the communication with the first call terminal.
- the first information is voice information
- the second information is text information.
- an embodiment of the present invention also provides a terminal device.
- the terminal device is a first call terminal.
- the terminal device includes a processor, a memory, and a device that is stored in the memory and can run on the processor.
- a computer program which, when executed by the processor, implements the steps of the call method described above.
- an embodiment of the present invention also provides a computer-readable storage medium having a computer program stored on the computer-readable storage medium, and when the computer program is executed by a processor, the steps of the call method described above are implemented.
- the first talking terminal can use the voice assistant to convert the voice information of the second talking terminal into text information, so that the user of the first communication terminal can obtain the expression content of the user of the second talking terminal by viewing the text information.
- the first call terminal can receive the text information input by the user, and convert the text information into voice information through the voice assistant, so as to transmit the voice information to The second call terminal, in this way, even if the user at the first call terminal is inconvenient to speak, the user at the second call terminal can still be called. It can be seen that the embodiment of the present invention can improve the call quality.
- FIG. 1 is one of the flowcharts of a call method provided by an embodiment of the present invention
- Figure 2 is a schematic diagram of a call page according to an embodiment of the present invention.
- FIG. 3 is the second flowchart of the call method provided by the embodiment of the present invention.
- FIG. 4 is the third flowchart of the call method provided by the embodiment of the present invention.
- FIG. 5 is one of the structural diagrams of a terminal device provided by an embodiment of the present invention.
- Fig. 6 is a second structural diagram of a terminal device provided by an embodiment of the present invention.
- first”, “second”, etc. in the present invention are used to distinguish similar objects, and are not necessarily used to describe a specific sequence or sequence.
- the terms “including” and “having” and any variations of them are intended to cover non-exclusive inclusions.
- a process, method, system, product, or device that includes a series of steps or units is not necessarily limited to those clearly listed. Those steps or units may include other steps or units that are not clearly listed or are inherent to these processes, methods, products, or equipment.
- the use of "and/or" in the present invention means at least one of the connected objects, such as A and/or B and/or C, which means that it includes A alone, B alone, C alone, and both A and B exist, Both B and C exist, A and C exist, and A, B, and C all exist in 7 cases.
- the call method in the embodiment of the present invention can be applied to the first call terminal.
- the first call terminal can establish a call with other second call terminals.
- the expression form of the call may include a telephone call, a voice call, a video call, and so on.
- the call terminal can be a mobile phone, a tablet (Personal Computer), a wearable device (Wearable Device), and so on.
- the first talking terminal in the embodiment of the present invention may include, but is not limited to, a voice assistant.
- the voice assistant has a speech recognition (Automatic Speech Recognition, ASR) function and/or a speech synthesis (Text-To-Speech, TTS) function. Specifically, when the voice recognition function is turned on, the voice assistant can convert voice information into text information; when the voice synthesis function is turned on, the voice assistant can convert text information into voice information.
- ASR Automatic Speech Recognition
- TTS Text-To-Speech
- Fig. 1 is one of the flowcharts of the call method provided by the embodiment of the present invention. As shown in Figure 1, the call method may include the following steps:
- Step 101 When the voice assistant is turned on, obtain the first information of the target calling terminal through the voice assistant.
- the first talking terminal can start the voice assistant when any of the following conditions is met:
- the first condition the input of the auxiliary call control is received
- the second condition the input of the physical artificial intelligence (Artificial Intelligence, AI) key is received.
- AI Artificial Intelligence
- the auxiliary call control may be displayed on the incoming call page and/or the call page. As shown in FIG. 2, the auxiliary call control 21 is displayed on the call page 22.
- the target call terminal may be: the first call terminal; or, the second call terminal that talks with the first call terminal.
- the expression form of the first information is related to the expression form of the target call terminal.
- the first information is text information.
- the user inputs text information on the screen of the first call terminal.
- the first information is voice information.
- the voice assistant can obtain the voice information of the second call terminal in the following two ways.
- the voice assistant can obtain the voice information of the second call terminal through the earpiece.
- the voice assistant directly obtains the electrical signal of the voice information from the earpiece.
- the voice assistant can obtain the voice information of the second call terminal through the microphone. In this implementation mode, the voice assistant directly obtains the voice signal from the microphone.
- Step 102 Transform the first information into second information through the voice assistant.
- the voice assistant completes the conversion of text information to voice information through the voice synthesis function.
- the second information is text information.
- the voice assistant completes the conversion of voice information to text information through the voice recognition function.
- Step 103 Output the second information; in the case where the target call terminal is the first call terminal, the first information is text information, and the second information is voice information; and/or In the case where the target call terminal is a second call terminal that is talking with the first call terminal, the first information is voice information, and the second information is text information.
- the second information is voice information.
- the output of the second information includes: outputting the second information through a target microphone; wherein, the target microphone may be the first microphone in the embodiment of the present invention, or may be another microphone of the first call terminal.
- the first talking terminal can receive the text information input by the user, and use the voice assistant to convert the text information into voice information, so as to transmit the voice information to the second talking terminal. In this way, even at the first talking terminal When the user is inconvenient to speak, he can also talk to the user of the second call terminal, so that the quality of the call can be improved.
- the second information is text information.
- the outputting the second information includes: displaying the second information on the screen of the first call terminal.
- the area of the screen for displaying the second information may be the entire area of the screen, or may be a part of the area of the screen, which may be specifically determined according to actual needs. It should be understood that the present invention does not limit the size and position of the area for displaying the second information on the screen.
- the first calling terminal can use the voice assistant to convert the voice information of the second calling terminal into text information, and display the text information on the screen, so that the user of the first calling terminal can obtain the second call by viewing the text information
- the expression content of the user at the end of the call enriches the way for the user at the first call end to obtain the expression content of the user at the second call end, thereby improving the quality of the call.
- the first talking terminal only enables the voice recognition function of the voice assistant.
- the user at the first call terminal can have two ways to obtain the expression content of the user at the second call terminal: 1. Listen to the voice information of the second call terminal output through the receiver of the first call terminal; 2. View the first call The text information converted from the voice information of the second calling terminal displayed on the screen of the calling terminal.
- the first talking terminal obtains the expression content of the user at the first talking terminal by collecting voice information.
- the call flow of the first call terminal may include:
- the second voice information of the first call terminal is collected through the first microphone, and the second voice information is sent.
- the user of the first talking terminal obtains the expression content of the user of the second talking terminal by listening to the voice information of the second talking terminal output by the earpiece of the first talking terminal.
- the first talking terminal obtains the expression content of the user at the first talking terminal by acquiring the text information input by the user. And the first talking terminal converts the text information input by the user into voice information through the voice assistant, so as to transmit the voice information to the second talking terminal.
- the voice objects of the voice information are substantially different.
- the voice object of the voice information is the user of the first talking terminal
- the voice object of the voice information is the first talking terminal
- the call flow of the first call terminal may include:
- the fourth voice information received from the second talking terminal is output.
- the first talking terminal turns on the voice recognition function and voice synthesis function of the voice assistant.
- the user at the first call end can have two ways to obtain the expression content of the user at the second call end: 1. Listen to the voice information of the second call end output through the receiver of the first call end; 2. View the first call end’s voice information. The text information converted from the voice information of the second calling terminal displayed on the screen of the calling terminal.
- the first talking terminal obtains the expression content of the user at the first talking terminal by acquiring the text information input by the user. And the first talking terminal converts the text information input by the user into voice information through the voice assistant, so as to transmit the voice information to the second talking terminal.
- the call flow of the first call terminal may include:
- the first call terminal can convert the voice information of the second call terminal into text information through the voice assistant, so that the user of the first communication terminal can obtain the expression content of the user of the second call terminal by viewing the text information, which is rich
- the first call terminal can receive the text information input by the user, and convert the text information into voice information through the voice assistant, so as to transmit the voice information to The second call terminal, in this way, even if the user at the first call terminal is inconvenient to speak, the user at the second call terminal can still be called. It can be seen that the embodiment of the present invention can improve the call quality.
- the second information is text information
- the outputting the second information includes:
- the method further includes:
- the text information displayed on the screen is saved.
- the text information displayed on the screen may include: text information converted from the voice information of the second call terminal.
- the text information displayed on the screen may include: text information converted from the voice information of the second call terminal, and text information input by the user of the first call terminal.
- the displaying the second information on the screen includes:
- the call page and the text page are displayed separately on the screen, and the text page is used to display the second information.
- the screen in the split-screen display mode, the screen can display the call page and the text page at the same time, which can enrich the user's expression content obtained by the user of the second call terminal without hindering the operation of the call page, thereby further improving the call quality.
- the first talking terminal can trigger the screen to enter the split-screen display mode after turning on the voice assistant; it can also trigger the screen to enter the split-screen display mode after the text information is converted, but it is not limited to this.
- the first talking terminal may also display the second information in a full screen.
- the first call terminal further includes:
- a containing cavity, the containing cavity is made of sound-proof material
- the first earpiece is arranged outside the accommodating cavity and is used to output the voice information of the second call terminal;
- the first microphone is arranged outside the accommodating cavity and is used to collect voice information of the call environment of the first call terminal;
- the second earpiece is arranged in the accommodating cavity and is used to output the voice information of the second call terminal;
- the second microphone is arranged in the accommodating cavity and is electrically connected to the voice assistant, and is used to obtain the voice information output by the second earpiece and transmit the voice information to the voice assistant.
- both the first earpiece and the second earpiece can output the received voice information of the second call terminal.
- the voice information output by the first earpiece allows the user to obtain the expression content of the user at the second call end by listening to the voice information;
- the voice information output by the second earpiece can be transmitted to the voice assistant through the second microphone, so that the voice assistant can
- the voice information is converted into text information, so that the user can obtain the expression content of the user at the second call terminal by viewing the text information.
- Both the first microphone and the second microphone are used to collect voice information, but the first microphone collects the voice information of the external environment of the first talking terminal, that is, the first microphone collects the voice information of the talking environment of the first talking terminal; What the second microphone collects is the voice information output by the second earpiece.
- the second earpiece and the second microphone are arranged in the accommodating cavity made of soundproof material, the quality of the voice information obtained by the voice assistant can be improved, and the call quality can be improved.
- the second information is text information; the method further includes at least one of the following:
- the target parameter value is used to characterize the quality of the call environment of the first call terminal.
- the above steps can be applied to the process in which the first talking terminal obtains the first information of the target talking terminal through the voice assistant, that is, the first talking terminal may obtain the voice information through the voice assistant according to the target parameter.
- the comparison result of the value and the threshold value controls the working state of the first microphone.
- the target parameter value is greater than the threshold, it indicates that the call environment of the first call terminal is poor and the noise is large. Therefore, in order to reduce the influence of external noise on the second microphone, the first microphone can be turned off.
- the target parameter value is less than or equal to the threshold value
- the target parameter value includes a volume value output by the first earpiece.
- a compartment can be made of sound insulation material inside the call terminal, and a miniature special call speaker and an AI special microphone are placed in the compartment.
- Step 301 In the case of detecting an incoming call, display an auxiliary call control on the incoming call and called interface.
- the auxiliary function is used to answer; if the touch operation on the auxiliary call control is not detected, the auxiliary function is not used to answer the call.
- Step 302 Detect whether the auxiliary function is enabled to answer.
- step 304 is executed, otherwise, step 303 is executed.
- Step 303 Enter a normal call answering interface.
- an auxiliary call control can be displayed on the call answering interface. If a touch operation on the auxiliary call control is detected, the auxiliary function is used to answer, and step 304 is executed.
- Step 304 Start the voice assistant and control the screen to split up and down.
- the call page is zoomed in and out, and all the buttons of the phone answering process are retained.
- the lower half is the voice assistant wake-up interface.
- the call terminal will be opened.
- the internal AI dedicated speaker and microphone display the text recognized by ASR in the lower half of the screen.
- the user can assist this call based on the text content in the lower half of the screen.
- the external microphone in order to prevent the external microphone from affecting the internal microphone noise, after opening the auxiliary call, when the earpiece volume is greater than the threshold, the external microphone should be turned off. When the earpiece volume is less than the threshold, the external microphone is turned on again.
- Step 305 When the call ends, the call page is automatically closed, the AI dedicated speaker and radio port are closed, and the mobile phone voice assistant interface in the lower half of the screen is expanded to the full screen; the call content can be selectively saved by the user.
- a compartment can be made of sound insulation material inside the call terminal, and a miniature special call speaker and an AI special microphone are placed in the compartment.
- Step 401 The system detects an incoming call.
- Step 402 When the incoming call is answered, enter the normal call answering interface.
- the user can wake up the AI-assisted call through the physical AI key of the mobile phone.
- Step 403. When the AI assisted mobile phone call is enabled, and control the screen to split up and down, the upper half of the screen is the zoom of the call page, and all the buttons of the phone answering process are retained, and the lower half of the screen is the same of the voice assistant wake-up interface. Zooming, the AI dedicated speaker and microphone inside the call terminal will be turned on, and the text recognized by ASR will be displayed in the lower half of the screen.
- the user can assist this call based on the text content in the lower half of the screen.
- the external microphone in order to prevent the external microphone from affecting the internal microphone noise, after opening the auxiliary call, when the earpiece volume is greater than the threshold, the external microphone should be turned off. When the earpiece volume is less than the threshold, the external microphone is turned on again.
- Step 404 When the call ends, the call page is automatically closed, the AI dedicated speaker and radio port are closed, and the mobile phone voice assistant interface in the lower half of the screen is expanded to the full screen; the call content can be selectively saved by the user.
- FIG. 5 is one of the structural diagrams of a terminal device provided by an embodiment of the present invention.
- the terminal device 500 is the first call terminal in the method embodiment of the present invention, and the terminal device 500 includes a voice assistant; as shown in FIG. 5, the terminal device 500 includes:
- the obtaining module 501 is configured to obtain the first information of the target call terminal through the voice assistant when the voice assistant is turned on;
- the conversion module 502 is configured to convert the first information into second information through the voice assistant
- the output module 503 is configured to output the second information
- the target call terminal is the first call terminal
- the first information is text information
- the second information is voice information
- the communication with the In the case of the second call end of the first call end the first information is voice information
- the second information is text information
- the second information is text information
- the output module 503 is specifically used for:
- the terminal device 500 further includes:
- the saving module is configured to save the text information displayed on the screen when the first input is received after the output module outputs the second information.
- the output module 503 is specifically used for:
- the call page and the text page are displayed separately on the screen, and the text page is used to display the second information.
- the terminal device 500 further includes:
- a containing cavity, the containing cavity is made of sound-proof material
- the first earpiece is arranged outside the accommodating cavity and is used to output the voice information of the second call terminal;
- the first microphone is arranged outside the accommodating cavity and is used to collect voice information of the call environment of the first call terminal;
- the second earpiece is arranged in the accommodating cavity and is used to output the voice information of the second call terminal;
- the second microphone is arranged in the accommodating cavity and is electrically connected to the voice assistant, and is used to obtain the voice information output by the second earpiece and transmit the voice information to the voice assistant.
- the second information is text information
- the terminal device further includes a control module configured to perform at least one of the following:
- the target parameter value is used to characterize the quality of the call environment of the first call terminal.
- the target parameter value includes a volume value output by the first earpiece.
- the terminal device 500 can implement each process that can be implemented by the first talking terminal in the method embodiment of the present invention, and achieve the same beneficial effects. To avoid repetition, details are not described herein again.
- FIG. 6 is a second structural diagram of a terminal device provided by an embodiment of the present invention, and may be a schematic diagram of a hardware structure of a first call terminal that implements various embodiments of the present invention.
- the terminal device 600 is the first call terminal in the method embodiment of the present invention, and the terminal device 600 includes a voice assistant.
- the terminal device 600 includes but is not limited to: a radio frequency unit 601, a network module 602, an audio output unit 603, an input unit 604, a sensor 605, a display unit 606, a user input unit 607, an interface unit 608, a memory 609, The processor 610, and the power supply 611 and other components.
- a radio frequency unit 601 a radio frequency unit 601
- a network module 602 an audio output unit 603, an input unit 604, a sensor 605, a display unit 606, a user input unit 607, an interface unit 608, a memory 609,
- the processor 610, and the power supply 611 and other components are examples of the terminal device shown in
- terminal devices include, but are not limited to, mobile phones, tablet computers, notebook computers, palmtop computers, vehicle-mounted terminals, wearable devices, and pedometers.
- the processor 610 is used for:
- the target call terminal is the first call terminal
- the first information is text information
- the second information is voice information
- the communication with the In the case of the second call end of the first call end the first information is voice information
- the second information is text information
- the second information is text information; the processor 610 is further configured to:
- the text information displayed on the screen is saved.
- processor 610 is also used for:
- the call page and the text page are split-screen displayed on the screen by the display unit 606, and the text page is used to display the second information.
- the terminal device 600 further includes:
- a containing cavity, the containing cavity is made of sound-proof material
- the first earpiece is arranged outside the accommodating cavity and is used to output the voice information of the second call terminal;
- the first microphone is arranged outside the accommodating cavity and is used to collect voice information of the call environment of the first call terminal;
- the second earpiece is arranged in the accommodating cavity and is used to output the voice information of the second call terminal;
- the second microphone is arranged in the accommodating cavity and is electrically connected to the voice assistant, and is used to obtain the voice information output by the second earpiece and transmit the voice information to the voice assistant.
- the second information is text information; the processor 610 is further configured to:
- the target parameter value is used to characterize the quality of the call environment of the first call terminal.
- the target parameter value includes a volume value output by the first earpiece.
- terminal device 600 in this embodiment can implement each process in the method embodiment in the embodiment of the present invention and achieve the same beneficial effects. To avoid repetition, details are not described herein again.
- the radio frequency unit 601 can be used to receive and send signals during information transmission or communication. Specifically, the downlink data from the base station is received and sent to the processor 610 for processing; in addition, Uplink data is sent to the base station.
- the radio frequency unit 601 includes, but is not limited to, an antenna, at least one amplifier, a transceiver, a coupler, a low noise amplifier, a duplexer, and the like.
- the radio frequency unit 601 can also communicate with the network and other devices through a wireless communication system.
- the terminal device provides users with wireless broadband Internet access through the network module 602, such as helping users to send and receive emails, browse web pages, and access streaming media.
- the audio output unit 603 can convert the audio data received by the radio frequency unit 601 or the network module 602 or stored in the memory 609 into audio signals and output them as sounds. Moreover, the audio output unit 603 may also provide audio output related to a specific function performed by the terminal device 600 (for example, call signal reception sound, message reception sound, etc.).
- the audio output unit 603 includes a speaker, a buzzer, a receiver, and the like.
- the input unit 604 is used to receive audio or video signals.
- the input unit 604 may include a graphics processing unit (GPU) 6041 and a microphone 6042, and the graphics processor 6041 is used to capture images of still pictures or videos obtained by an image capture device (such as a camera) in a video capture mode or an image capture mode.
- the data is processed.
- the processed image frame can be displayed on the display unit 606.
- the image frame processed by the graphics processor 6041 may be stored in the memory 609 (or other storage medium) or sent via the radio frequency unit 601 or the network module 602.
- the microphone 6042 can receive sound, and can process such sound into audio data.
- the processed audio data can be converted into a format that can be sent to the mobile communication base station via the radio frequency unit 601 for output in the case of a telephone call mode.
- the terminal device 600 also includes at least one sensor 605, such as a light sensor, a motion sensor, and other sensors.
- the light sensor includes an ambient light sensor and a proximity sensor.
- the ambient light sensor can adjust the brightness of the display panel 6061 according to the brightness of the ambient light, and the proximity sensor can close the display panel 6061 and / Or backlight.
- the accelerometer sensor can detect the magnitude of acceleration in various directions (usually three-axis), and can detect the magnitude and direction of gravity when stationary, and can be used to identify the posture of the terminal device (such as horizontal and vertical screen switching, related games) , Magnetometer attitude calibration), vibration recognition related functions (such as pedometer, tap), etc.; sensor 605 can also include fingerprint sensor, pressure sensor, iris sensor, molecular sensor, gyroscope, barometer, hygrometer, thermometer, Infrared sensors, etc., will not be repeated here.
- the display unit 606 is used to display information input by the user or information provided to the user.
- the display unit 606 may include a display panel 6061, and the display panel 6061 may be configured in the form of a liquid crystal display (LCD), an organic light-emitting diode (OLED), etc.
- LCD liquid crystal display
- OLED organic light-emitting diode
- the user input unit 607 may be used to receive inputted numeric or character information, and generate key signal input related to user settings and function control of the terminal device.
- the user input unit 607 includes a touch panel 6071 and other input devices 6072.
- the touch panel 6071 also called a touch screen, can collect user touch operations on or near it (for example, the user uses any suitable objects or accessories such as fingers, stylus, etc.) on the touch panel 6071 or near the touch panel 6071. operating).
- the touch panel 6071 may include two parts: a touch detection device and a touch controller.
- the touch detection device detects the user's touch position, detects the signal brought by the touch operation, and transmits the signal to the touch controller; the touch controller receives the touch information from the touch detection device, converts it into contact coordinates, and then sends it To the processor 610, the command sent by the processor 610 is received and executed.
- the touch panel 6071 can be implemented in multiple types such as resistive, capacitive, infrared, and surface acoustic wave.
- the user input unit 607 may also include other input devices 6072.
- other input devices 6072 may include, but are not limited to, a physical keyboard, function keys (such as volume control buttons, switch buttons, etc.), trackball, mouse, and joystick, which will not be repeated here.
- the touch panel 6071 can cover the display panel 6061.
- the touch panel 6071 detects a touch operation on or near it, it is transmitted to the processor 610 to determine the type of the touch event, and then the processor 610 determines the type of the touch event according to the touch.
- the type of event provides corresponding visual output on the display panel 6061.
- the touch panel 6071 and the display panel 6061 are used as two independent components to implement the input and output functions of the terminal device, in some embodiments, the touch panel 6071 and the display panel 6061 can be integrated
- the implementation of the input and output functions of the terminal device is not specifically limited here.
- the interface unit 608 is an interface for connecting an external device and the terminal device 600.
- the external device may include a wired or wireless headset port, an external power source (or battery charger) port, a wired or wireless data port, a memory card port, a port for connecting a device with an identification module, audio input/output (Input/Output, I/O) port, video I/O port, headphone port, etc.
- the interface unit 608 can be used to receive input (for example, data information, power, etc.) from an external device and transmit the received input to one or more elements in the terminal device 600 or can be used to connect to the terminal device 600 and an external device. Transfer data between devices.
- the memory 609 can be used to store software programs and various data.
- the memory 609 may mainly include a storage program area and a storage data area.
- the storage program area may store an operating system, an application program required by at least one function (such as a sound playback function, an image playback function, etc.), etc.; Data created by the use of mobile phones (such as audio data, phone book, etc.), etc.
- the memory 609 may include a high-speed random access memory, and may also include a non-volatile memory, such as at least one magnetic disk storage device, a flash memory device, or other volatile solid-state storage devices.
- the processor 610 is the control center of the terminal device. It uses various interfaces and lines to connect the various parts of the entire terminal device, runs or executes software programs and/or modules stored in the memory 609, and calls data stored in the memory 609. , Perform various functions of the terminal equipment and process data, so as to monitor the terminal equipment as a whole.
- the processor 610 may include one or more processing units; preferably, the processor 610 may integrate an application processor and a modem processor, where the application processor mainly processes the operating system, user interface, application programs, etc., and the modem
- the processor mainly deals with wireless communication. It can be understood that the foregoing modem processor may not be integrated into the processor 610.
- the terminal device 600 may also include a power source 611 (such as a battery) for supplying power to various components.
- a power source 611 such as a battery
- the power source 611 may be logically connected to the processor 610 through a power management system, so as to manage charging, discharging, and power consumption management through the power management system. And other functions.
- the terminal device 600 includes some functional modules not shown, which will not be repeated here.
- the embodiment of the present invention also provides a terminal device, the terminal device is a first call terminal, the terminal device includes a processor 610, a memory 609, stored in the memory 609 and available on the processor 610
- the running computer program when the computer program is executed by the processor 610, realizes each process of the embodiment of the above-mentioned call method, and can achieve the same technical effect. In order to avoid repetition, it will not be repeated here.
- the embodiment of the present invention also provides a computer-readable storage medium, and a computer program is stored on the computer-readable storage medium.
- a computer program is stored on the computer-readable storage medium.
- the computer program is executed by a processor, each process of the above-mentioned calling method embodiment is realized, and the same technical effect can be achieved. To avoid repetition, I won’t repeat it here.
- the computer-readable storage medium such as read-only memory (Read-Only Memory, ROM), random access memory (Random Access Memory, RAM), magnetic disk, or optical disk, etc.
- the disclosed device and method may be implemented in other ways.
- the device embodiments described above are merely illustrative.
- the division of the units is only a logical function division, and there may be other divisions in actual implementation, for example, multiple units or components may be combined or It can be integrated into another system, or some features can be ignored or not implemented.
- the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical, mechanical or other forms.
- the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments.
- the functional units in the various embodiments of the present disclosure may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit.
- the technical solution of the present invention essentially or the part that contributes to the existing technology can be embodied in the form of a software product, and the computer software product is stored in a storage medium (such as ROM/RAM, magnetic disk, The optical disc) includes a number of instructions to enable a terminal (which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) to execute the method described in each embodiment of the present invention.
- a terminal which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.
- the program can be stored in a computer readable storage medium, and the program can be stored in a computer readable storage medium. When executed, it may include the procedures of the above-mentioned method embodiments.
- the storage medium may be a magnetic disk, an optical disk, a read-only memory (Read-Only Memory, ROM), or a random access memory (Random Access Memory, RAM), etc.
- modules, units, and sub-units can be implemented in one or more Application Specific Integrated Circuits (ASIC), Digital Signal Processor (DSP), Digital Signal Processing Device (DSP Device, DSPD) ), programmable logic devices (Programmable Logic Device, PLD), Field-Programmable Gate Array (Field-Programmable Gate Array, FPGA), general-purpose processors, controllers, microcontrollers, microprocessors, used to execute the present disclosure Described functions in other electronic units or combinations thereof.
- ASIC Application Specific Integrated Circuits
- DSP Digital Signal Processor
- DSP Device Digital Signal Processing Device
- DSPD Digital Signal Processing Device
- PLD programmable logic devices
- Field-Programmable Gate Array Field-Programmable Gate Array
- FPGA Field-Programmable Gate Array
- the technology described in the embodiments of the present disclosure can be implemented by modules (for example, procedures, functions, etc.) that perform the functions described in the embodiments of the present disclosure.
- the software codes can be stored in the memory and executed by the processor.
- the memory can be implemented in the processor or external to the processor.
Landscapes
- Engineering & Computer Science (AREA)
- Human Computer Interaction (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Business, Economics & Management (AREA)
- General Business, Economics & Management (AREA)
- Telephone Function (AREA)
Abstract
Description
Claims (14)
- 一种通话方法,应用于第一通话端,其特征在于,所述第一通话端包括语音助手;所述方法包括:在所述语音助手开启的情况下,通过所述语音助手获取目标通话端的第一信息;通过所述语音助手将所述第一信息转化为第二信息;输出所述第二信息;其中,在所述目标通话端为所述第一通话端的情况下,所述第一信息为文本信息,所述第二信息为语音信息;和/或,在所述目标通话端为与所述第一通话端通话的第二通话端的情况下,所述第一信息为语音信息,所述第二信息为文本信息。
- 根据权利要求1所述的方法,其特征在于,所述第二信息为文本信息;所述输出所述第二信息,包括:在屏幕上显示所述第二信息;所述输出所述第二信息之后,所述方法还包括:在接收到第一输入的情况下,保存所述屏幕上显示的文本信息。
- 根据权利要求2所述的方法,其特征在于,所述在屏幕上显示所述第二信息,包括:在所述屏幕上分屏显示通话页面和文本页面,所述文本页面用于显示所述第二信息。
- 根据权利要求1至3中任一项所述的方法,其特征在于,所述第一通话端还包括:容纳腔,所述容纳腔由隔音材料制成;第一听筒,设于所述容纳腔外,用于输出所述第二通话端的语音信息;第一话筒,设于所述容纳腔外,用于采集所述第一通话端的通话环境的语音信息;第二听筒,设于所述容纳腔内,用于输出所述第二通话端的语音信息;第二话筒,设于所述容纳腔内,与所述语音助手电连接,用于获取所述 第二听筒输出的语音信息,并将所述语音信息传输至所述语音助手。
- 根据权利要求4所述的方法,其特征在于,所述第二信息为文本信息;所述方法还包括以下至少一项:在目标参数值大于阈值的情况下,控制所述第一话筒处于关闭状态;在所述目标参数值小于或等于所述阈值的情况下,控制所述第一话筒处于开启状态;其中,所述目标参数值用于表征所述第一通话端的通话环境的优劣程度。
- 根据权利要求5所述的方法,其特征在于,所述目标参数值包括所述第一听筒输出的音量值。
- 一种终端设备,所述终端设备为第一通话端,其特征在于,所述终端设备包括语音助手;所述终端设备包括:获取模块,用于在所述语音助手开启的情况下,通过所述语音助手获取目标通话端的第一信息;转化模块,用于通过所述语音助手将所述第一信息转化为第二信息;输出模块,用于输出所述第二信息;其中,在所述目标通话端为所述第一通话端的情况下,所述第一信息为文本信息,所述第二信息为语音信息;和/或,在所述目标通话端为与所述第一通话端通话的第二通话端的情况下,所述第一信息为语音信息,所述第二信息为文本信息。
- 根据权利要求7所述的终端设备,其特征在于,所述第二信息为文本信息;所述输出模块,具体用于:在屏幕上显示所述第二信息;所述终端设备还包括:保存模块,用于在所述输出模块输出所述第二信息之后,在接收到第一输入的情况下,保存所述屏幕上显示的文本信息。
- 根据权利要求8所述的终端设备,其特征在于,所述输出模块,具体用于:在所述屏幕上分屏显示通话页面和文本页面,所述文本页面用于显示所 述第二信息。
- 根据权利要求7至9中任一项所述的终端设备,其特征在于,所述终端设备还包括:容纳腔,所述容纳腔由隔音材料制成;第一听筒,设于所述容纳腔外,用于输出所述第二通话端的语音信息;第一话筒,设于所述容纳腔外,用于采集所述第一通话端的通话环境的语音信息;第二听筒,设于所述容纳腔内,用于输出所述第二通话端的语音信息;第二话筒,设于所述容纳腔内,与所述语音助手电连接,用于获取所述第二听筒输出的语音信息,并将所述语音信息传输至所述语音助手。
- 根据权利要求10所述的终端设备,其特征在于,所述第二信息为文本信息;所述终端设备还包括控制模块,用于执行以下至少一项:在目标参数值大于阈值的情况下,控制所述第一话筒处于关闭状态;在所述目标参数值小于或等于所述阈值的情况下,控制所述第一话筒处于开启状态;其中,所述目标参数值用于表征所述第一通话端的通话环境的优劣程度。
- 根据权利要求11所述的终端设备,其特征在于,所述目标参数值包括所述第一听筒输出的音量值。
- 一种终端设备,所述终端设备为第一通话端,其特征在于,包括处理器、存储器及存储在所述存储器上并可在所述处理器上运行的计算机程序,所述计算机程序被所述处理器执行时实现如权利要求1至6中任一项所述的通话方法的步骤。
- 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质上存储有计算机程序,所述计算机程序被处理器执行时实现如权利要求1至6中任一项所述的通话方法的步骤。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911155039.XA CN110913070B (zh) | 2019-11-22 | 2019-11-22 | 一种通话方法及终端设备 |
CN201911155039.X | 2019-11-22 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2021098708A1 true WO2021098708A1 (zh) | 2021-05-27 |
Family
ID=69818857
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2020/129662 WO2021098708A1 (zh) | 2019-11-22 | 2020-11-18 | 通话方法及终端设备 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN110913070B (zh) |
WO (1) | WO2021098708A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113660375A (zh) * | 2021-08-11 | 2021-11-16 | 维沃移动通信有限公司 | 通话方法、装置及电子设备 |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110913070B (zh) * | 2019-11-22 | 2021-11-23 | 维沃移动通信有限公司 | 一种通话方法及终端设备 |
Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101510917A (zh) * | 2009-03-11 | 2009-08-19 | 宇龙计算机通信科技(深圳)有限公司 | 一种移动终端无声通话的方法及移动终端 |
CN101610465A (zh) * | 2008-06-18 | 2009-12-23 | 朗讯科技公司 | 用于将文本信息转换为语音信息的通信方法及通信系统 |
CN103973877A (zh) * | 2013-02-06 | 2014-08-06 | 北京壹人壹本信息科技有限公司 | 一种在移动终端中利用文字实现实时通话的方法和装置 |
CN104285428A (zh) * | 2012-05-08 | 2015-01-14 | 三星电子株式会社 | 用于运行通信服务的方法和系统 |
CN104869225A (zh) * | 2014-02-21 | 2015-08-26 | 宏达国际电子股份有限公司 | 智能对话方法和使用所述方法的电子装置 |
KR101609585B1 (ko) * | 2014-11-28 | 2016-04-06 | 박지선 | 청각 장애인용 이동 통신 단말기 |
CN106131288A (zh) * | 2016-08-25 | 2016-11-16 | 深圳市金立通信设备有限公司 | 一种通话信息的记录方法及终端 |
CN106412259A (zh) * | 2016-09-14 | 2017-02-15 | 广东欧珀移动通信有限公司 | 移动终端通话控制方法、装置及移动终端 |
CN108769363A (zh) * | 2018-04-13 | 2018-11-06 | 珠海市魅族科技有限公司 | 通话方法及装置、计算机装置和计算机可读存储介质 |
CN110913070A (zh) * | 2019-11-22 | 2020-03-24 | 维沃移动通信有限公司 | 一种通话方法及终端设备 |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20110051385A (ko) * | 2009-11-10 | 2011-05-18 | 삼성전자주식회사 | 통신 단말기 및 그의 통신 방법 |
CN102710539A (zh) * | 2012-05-02 | 2012-10-03 | 中兴通讯股份有限公司 | 语音信息传送方法及装置 |
CN105847580A (zh) * | 2016-05-04 | 2016-08-10 | 浙江吉利控股集团有限公司 | 一种可实现第三方来电语音提醒的系统及方法 |
CN107103899B (zh) * | 2017-04-24 | 2020-06-19 | 北京小米移动软件有限公司 | 输出语音消息的方法和装置 |
CN109036404A (zh) * | 2018-07-18 | 2018-12-18 | 北京小米移动软件有限公司 | 语音交互方法及装置 |
-
2019
- 2019-11-22 CN CN201911155039.XA patent/CN110913070B/zh active Active
-
2020
- 2020-11-18 WO PCT/CN2020/129662 patent/WO2021098708A1/zh active Application Filing
Patent Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101610465A (zh) * | 2008-06-18 | 2009-12-23 | 朗讯科技公司 | 用于将文本信息转换为语音信息的通信方法及通信系统 |
CN101510917A (zh) * | 2009-03-11 | 2009-08-19 | 宇龙计算机通信科技(深圳)有限公司 | 一种移动终端无声通话的方法及移动终端 |
CN104285428A (zh) * | 2012-05-08 | 2015-01-14 | 三星电子株式会社 | 用于运行通信服务的方法和系统 |
CN103973877A (zh) * | 2013-02-06 | 2014-08-06 | 北京壹人壹本信息科技有限公司 | 一种在移动终端中利用文字实现实时通话的方法和装置 |
CN104869225A (zh) * | 2014-02-21 | 2015-08-26 | 宏达国际电子股份有限公司 | 智能对话方法和使用所述方法的电子装置 |
KR101609585B1 (ko) * | 2014-11-28 | 2016-04-06 | 박지선 | 청각 장애인용 이동 통신 단말기 |
CN106131288A (zh) * | 2016-08-25 | 2016-11-16 | 深圳市金立通信设备有限公司 | 一种通话信息的记录方法及终端 |
CN106412259A (zh) * | 2016-09-14 | 2017-02-15 | 广东欧珀移动通信有限公司 | 移动终端通话控制方法、装置及移动终端 |
CN108769363A (zh) * | 2018-04-13 | 2018-11-06 | 珠海市魅族科技有限公司 | 通话方法及装置、计算机装置和计算机可读存储介质 |
CN110913070A (zh) * | 2019-11-22 | 2020-03-24 | 维沃移动通信有限公司 | 一种通话方法及终端设备 |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113660375A (zh) * | 2021-08-11 | 2021-11-16 | 维沃移动通信有限公司 | 通话方法、装置及电子设备 |
Also Published As
Publication number | Publication date |
---|---|
CN110913070B (zh) | 2021-11-23 |
CN110913070A (zh) | 2020-03-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2021098678A1 (zh) | 投屏控制方法及电子设备 | |
US11848773B2 (en) | Transmit antenna switching method and terminal device | |
WO2021078116A1 (zh) | 视频处理方法及电子设备 | |
WO2020238635A1 (zh) | 移动终端及出音口的切换方法 | |
WO2021109907A1 (zh) | 应用分享方法、第一电子设备及计算机可读存储介质 | |
US11635939B2 (en) | Prompting method and mobile terminal | |
US20200257433A1 (en) | Display method and mobile terminal | |
WO2019201271A1 (zh) | 通话处理方法及移动终端 | |
WO2021129529A1 (zh) | 设备切换方法及相关设备 | |
CN111638779A (zh) | 音频播放控制方法、装置、电子设备及可读存储介质 | |
WO2021063249A1 (zh) | 电子设备的控制方法及电子设备 | |
CN108551534B (zh) | 多终端语音通话的方法及装置 | |
WO2021098633A1 (zh) | 信息显示、发送方法及电子设备 | |
WO2020220990A1 (zh) | 受话器控制方法及终端 | |
WO2020199986A1 (zh) | 视频通话方法及终端设备 | |
WO2019206077A1 (zh) | 视频通话处理方法及移动终端 | |
WO2021190545A1 (zh) | 通话处理方法及电子设备 | |
WO2021109959A1 (zh) | 应用程序分享方法及电子设备 | |
WO2021129835A1 (zh) | 音量控制方法、设备及计算机可读存储介质 | |
CN108712566A (zh) | 一种语音助手唤醒方法及移动终端 | |
WO2021238844A1 (zh) | 音频输出方法及电子设备 | |
WO2021098698A1 (zh) | 音频播放方法及终端设备 | |
WO2021098708A1 (zh) | 通话方法及终端设备 | |
WO2021197311A1 (zh) | 音量调节显示方法及电子设备 | |
WO2021169869A1 (zh) | 音频播放装置、音频播放方法及电子设备 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20890599 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20890599 Country of ref document: EP Kind code of ref document: A1 |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20890599 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 01.03.2023) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20890599 Country of ref document: EP Kind code of ref document: A1 |