WO2018137306A1 - Procédé et dispositif de déclenchement d'une fonction vocale - Google Patents

Procédé et dispositif de déclenchement d'une fonction vocale Download PDF

Info

Publication number
WO2018137306A1
WO2018137306A1 PCT/CN2017/087984 CN2017087984W WO2018137306A1 WO 2018137306 A1 WO2018137306 A1 WO 2018137306A1 CN 2017087984 W CN2017087984 W CN 2017087984W WO 2018137306 A1 WO2018137306 A1 WO 2018137306A1
Authority
WO
WIPO (PCT)
Prior art keywords
terminal device
interface content
contact
information
interface
Prior art date
Application number
PCT/CN2017/087984
Other languages
English (en)
Chinese (zh)
Inventor
王培�
何小文
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Priority to CN201780004960.7A priority Critical patent/CN108605074B/zh
Publication of WO2018137306A1 publication Critical patent/WO2018137306A1/fr

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/725Cordless telephones

Definitions

  • the present invention relates to the field of communications, and in particular, to a method and device for triggering a voice function.
  • the user needs to obtain the content displayed on the user interface (UI) of the terminal, and the user can browse the UI through the eyes, but the manner of obtaining is single, and when the user is inconvenient to operate the UI, the page is flipped as above.
  • the user is not able to quickly browse the content presented by the UI, and only can view the current UI interface.
  • a multi-touch method for the viewing and operation of the content displayed on the UI, a button manner, or a gesture recognition manner may be adopted.
  • Multi-touch mode when users browse the content displayed on the UI, they need to constantly interact with the interface to continue to view the content; while the buttons and gestures can only complete a specific one for each gesture. Operation, at the time of the specific operation, browsing the UI content does not only include one operation; therefore, the user needs to learn and memorize the operations corresponding to a large number of gestures.
  • the above method has low operation efficiency, and improves user learning and memory burden, and reduces user experience.
  • Embodiments of the present invention relate to a method and apparatus for triggering a voice function. To solve the problem that the user is in the process of browsing the UI, the operation efficiency is low, and the user needs to learn and memorize a large number of gestures to complete the browsing UI.
  • the embodiment of the present invention provides a method for triggering a voice function, where the method includes: acquiring, by the terminal device, interface content presented on an interface of the terminal device; and detecting that the working mode of the terminal device is switched to the first mode, The terminal device converts the interface content into voice information for voice output, and the first mode is a working mode in which the terminal device accesses the earphone or activates the earphone function of the terminal device.
  • the voice function of the terminal device when the working mode of the terminal device is switched to the first mode, the voice function of the terminal device can be triggered, and the interface content of the terminal device is converted into voice information for output.
  • the operation efficiency is improved, and the operation of the user to learn and memorize a large number of gestures is reduced, and the user experience is improved.
  • the method further comprises: the terminal device determining the type of the interface element in the interface content.
  • the type of the interface element in the interface content includes one or more of text information, picture information, contact name, phone number, and contact avatar.
  • the type of interface element in the interface content includes picture information including one or more of text information, contact name, phone number, and contact avatar.
  • the interface content includes text information; and the terminal device converts the interface content into voice information for output, including: converting the text information into voice information, and performing voice output.
  • the interface content includes one or more of a contact name, a phone number, and a contact image; and the terminal device converts the interface content into voice information for output, including: the terminal device according to the contact name, The contact number associated with one or more of the phone number and the contact image is dialed.
  • the method before the step of converting the interface content into voice information for voice output, the method further includes: detecting whether the interface content includes system voice information of the terminal device; if the interface content does not include the terminal device The system language information converts the interface content into the target voice information; or converts the interface content into language information corresponding to the user's needs.
  • the embodiment of the present invention provides a terminal device, where the terminal device includes: an acquiring unit, configured to acquire interface content presented on an interface of the terminal device; and a detecting unit, configured to detect a working mode of the terminal device; The detecting unit detects that the working mode of the terminal device is switched to the first mode, and the executing unit converts the interface content of the terminal device into voice information, and then performs voice output by the output unit.
  • the first mode is an operating mode in which the terminal device accesses the headset or activates the headset function of the terminal device.
  • the execution unit converts the interface content of the terminal device into voice information, and then outputs the output by the output unit.
  • the embodiment of the invention can improve the operation efficiency, reduce the operation of the user to learn and memorize a large number of gestures, and improve the user experience.
  • the processing unit is configured to determine the type of interface element in the interface content.
  • the type of the interface element in the interface content includes one or more of text information, picture information, contact name, phone number, and contact avatar.
  • the type of the interface element in the interface content includes picture information including one or more of text information, contact name, phone number, and contact avatar.
  • the interface content includes text information
  • converting the interface content of the terminal device into voice information for output including: converting the text information into voice information, and performing voice output.
  • the interface content includes one or more of a contact name, a phone number, and a contact image; converting the interface content into voice information for output, including: according to the contact name, the phone number, and One or more associated contacts in the contact image are dialed.
  • the method before the step of converting the interface content into voice information for voice output, the method further includes: detecting whether the interface content includes system voice information of the terminal device; if the interface content does not include system language information of the terminal device, The interface content of the terminal device is converted into the target voice information; or the interface content is converted into language information corresponding to the user requirement.
  • the terminal device further includes an input unit, and the input unit is configured to receive the voice input of the user during the voice interaction between the terminal device and the user.
  • the embodiment of the present invention provides another terminal device, where the terminal device includes: a memory, configured to store program instructions, and a processor, configured to perform the following operations according to the program instructions stored in the memory: acquiring the terminal device Interface content presented on the interface; if it is detected that the working mode of the terminal device is switched to the first mode, the interface content is converted into voice information for voice output, and the first mode is the work of the terminal device accessing the earphone or starting the earphone function of the terminal device mode.
  • the processor is further configured to: according to the program instructions stored in the memory, determine the type of the interface element in the interface content after the step of acquiring the interface content presented on the interface of the terminal device.
  • the type of the interface element in the interface content includes one or more of text information, picture information, contact name, phone number, and contact avatar.
  • the type of the interface element in the interface content includes: picture information including one or more of text information, contact name, phone number, and contact avatar.
  • the processor is configured to: according to the program instructions stored in the memory, perform the following operations: converting the interface content into the voice information for output, including: converting the text information into the voice information , for voice output.
  • the processor is configured to perform the following operations according to the program instructions stored in the memory: converting the interface content
  • Outputting voice information includes dialing based on a contact associated with one or more of a contact name, a phone number, and a contact image.
  • the processor is further configured to: according to the program instructions stored in the memory, detecting whether the interface content includes system voice information of the terminal device before converting the interface content into voice information for voice output; If the interface content does not include the system language information of the terminal device, the interface content is converted into the target voice information; or the interface content is converted into the language information corresponding to the user requirement.
  • the method and device for triggering a voice function provided by the embodiment of the present invention switch the working mode of the terminal device to the first mode, and convert the interface content into voice information for voice output, and the first mode is the terminal device accessing.
  • the embodiment of the invention can improve the operation efficiency, reduce the operation of the user to learn and memorize a large number of gestures, and improve the user experience.
  • FIG. 1 is a schematic structural diagram of a terminal device according to an embodiment of the present disclosure
  • FIG. 2 is a schematic flowchart of a method for triggering a voice function according to an embodiment of the present invention
  • FIG. 3 is a possible implementation manner of a method for triggering a voice function according to an embodiment of the present disclosure
  • 4a-4d are another possible implementation manner of triggering a voice function according to an embodiment of the present invention.
  • 5a-5c are still another possible implementation manner of triggering a voice function according to an embodiment of the present invention.
  • 6a-6e are still another possible implementation manner of triggering a voice function according to an embodiment of the present invention.
  • FIG. 7 is a possible implementation manner of converting interface content into voice information according to an embodiment of the present disclosure.
  • FIG. 8 is a schematic structural diagram of still another terminal device according to an embodiment of the present invention.
  • FIG. 1 is a schematic structural diagram of a terminal device according to an embodiment of the present invention.
  • the terminal device may include a processor 180, a memory 120, a radio frequency (RF) circuit 110, and a peripheral system 170. These components can communicate over one or more communication buses 210.
  • RF radio frequency
  • peripheral system 170 can include other device controllers 171, sensor controllers 172, and display controllers 173. Each controller may be coupled to a respective peripheral device (eg, other input device 130, sensor 150, display 140). It should be noted that the peripheral system 170 may also include other I/O peripherals.
  • the display screen 140 can be used to display information input by a user or to present information to a user, for example, various menus of the terminal device, interfaces for displaying a running application, such as a button, and a text input box (Text) can be displayed. , Scroll Bar, Menu, and more.
  • the display panel 140 may include a display panel 141 and a touch panel 142.
  • the display panel 141 may be configured in the form of a liquid crystal display (LCD), an organic light-emitting diode (OLED), or the like.
  • the touch panel 142 can cover the display panel.
  • the touch panel 142 When the touch panel 142 detects a touch operation on or near the touch panel 142, it transmits to the processor 180 to determine the type of the touch event, and then the processor 180 is based on the type of the touch event.
  • a corresponding visual output is provided on display panel 141.
  • the touch panel 142 and the display panel 141 are two independent components to implement the input and output functions of the terminal device. However, in some embodiments, the touch panel 142 can be integrated with the display panel 141 to implement input of the terminal device. And output function.
  • a radio frequency (RF) circuit 110 is used to receive and transmit radio frequency signals, primarily integrating the receiver and transmitter of the terminal device. Radio frequency (RF) circuitry 110 communicates with the communication network and other communication devices via radio frequency signals.
  • radio frequency (RF) circuitry 110 may include, but is not limited to: an antenna system, an RF transceiver, one or more amplifiers, a tuner, one or more oscillators, a digital signal processor, a CODEC chip, a SIM Cards and storage media, etc.
  • radio frequency (RF) circuitry 110 can be implemented on a separate chip.
  • wireless transmission can be performed through the radio frequency circuit B03, such as Bluetooth transmission, Wireless-Fidelity (WI-FI) transmission, and third-generation mobile communication technology (3rd-Generation, 3G) transmission.
  • WI-FI Wireless-Fidelity
  • 3G third-generation mobile communication technology
  • 4G 4th Generation mobile communication
  • the audio circuit 160 is used for single item MP3 audio streaming and two-way voice transmission over the network.
  • the audio circuit 160 can include a speaker 161 and a microphone 162.
  • Memory 120 is coupled to processor 180 for storing various software programs and/or sets of instructions.
  • memory 120 may include high speed random access memory, and may also include non-volatile memory, such as one or more magnetic disk storage devices, flash memory devices, or other non-volatile solid state storage devices.
  • the memory 120 can store an operating system (hereinafter referred to as a system) such as an embedded operating system such as ANDROID, IOS, WINDOWS, or LINUX.
  • the memory 1120 can also store a network communication program that can be used to communicate with one or more additional devices, one or more terminal devices, one or more network devices.
  • the memory 120 can also store a user interface program, which can realistically display the content of the application through a graphical operation interface, and receive user control operations on the application through input controls such as menus, dialog boxes, and keys. .
  • terminal device is only an example provided by the embodiment of the present invention, and that the terminal device may have more or less components than the illustrated components, may combine two or more components, or may have components. Different configurations are implemented.
  • the above is a typical terminal device structure diagram, of course, based on different device forms can be increased or decreased on the basis, such as no audio circuit, no speakers, no microphone, no RF circuit, no other input Equipment; it is also possible to add, for example, a WIFI circuit, a Bluetooth circuit, an infrared circuit, and the like.
  • FIG. 2 is a flowchart of a method for triggering a voice function according to an embodiment of the present invention. As shown in FIG. 2, the method for triggering a voice function may include the following steps:
  • Step 201 The terminal device acquires interface content presented on an interface of the terminal device.
  • the terminal device can obtain the interface content presented on the interface of the terminal device by using the system software of the terminal device, and the system software is various independent hardware in the terminal device, so that they can work in coordination.
  • the interface content presented on the interface of the terminal device is referred to as the interface content of the terminal device.
  • the method for triggering the voice function further comprises: determining a type of the interface element in the interface content of the terminal device.
  • the terminal device determines the format of the interface element in the interface content, thereby determining the type of the interface element.
  • the format of the interface element includes: a format of the text (.TXT), a format of the image (.JPG), and after determining the format of the interface element in the interface content, the terminal device may determine that the type of the interface element in the interface content corresponds to the text.
  • Information picture information.
  • the type of the interface element in the interface content of the terminal device includes one or more of text information, picture information, contact name, phone number, and contact avatar.
  • the type of the interface element in the interface content of the terminal device includes picture information, and the picture information includes one or more of a contact name, a phone number, and a contact avatar.
  • Step 202 If it is detected that the working mode of the terminal device is switched to the first mode, the terminal device converts the interface content of the terminal device into voice information for voice output, and the first mode is that the terminal device accesses the earphone or activates the earphone function of the terminal device. Operating mode.
  • the interface content of the terminal device includes, but is not limited to, text information, picture information, contact name, phone number, and contact avatar.
  • the interface content of the terminal device includes text information. If it is detected that the working mode of the terminal device is switched to the first mode, the terminal device converts the text information into the voice information, and the interface content of the terminal device by the terminal device Conduct a voice announcement.
  • the text information is taken as an example in FIG.
  • the textual information may include, but is not limited to, a text interface, a picture interface displaying text.
  • the interface content of the terminal device includes picture information. If it is detected that the working mode of the terminal device is switched to the first mode, the terminal device converts the picture information into voice information, and the interface content of the terminal device by the terminal device Conduct a voice announcement.
  • the interface content of the terminal device may include one or more of a contact name, a phone number, and a contact avatar.
  • the interface content of the terminal device may include a contact name, or a phone number, or a contact avatar. If it is detected that the terminal device switches to the first mode, the terminal device performs a dialing operation according to a contact name on the user interface, or a phone number, or a contact associated with a contact avatar. Only the contact 1 is taken as an example in Fig. 4a. If it is detected that the terminal device switches to the first mode, the terminal device performs a dialing operation on the contact associated with the contact 1.
  • the terminal device can also implement the dialing operation by the contact associated with the contact avatar by using the foregoing manner.
  • the user interface of the terminal device includes a contact name corresponding to at least two phone numbers; or a contact avatar corresponding to at least two phone numbers.
  • a contact name corresponding to at least two phone numbers
  • a contact avatar corresponding to at least two phone numbers.
  • two telephone numbers corresponding to one contact are described as an example. If it is detected that the terminal device switches to the first mode, the terminal device selects one of the two phone numbers corresponding to a contact to perform a dialing operation.
  • the terminal device selects one of the two phone numbers corresponding to a contact to perform the dialing operation, and may include the following methods:
  • the first way the manufacturer can set up before the terminal device leaves the factory, or set by the user after leaving the factory. For example, the first bit of the object is sorted by default.
  • the telephone number corresponding to contact 1 is telephone number 1 (xxxx-xxxxxxxx) and telephone number 2 (xxx-xxxx-xxxx).
  • the telephone number 1 is sorted in the first place, and the terminal device performs a dialing operation according to the contact associated with the telephone number 1. With the default settings, it is very suitable for the operation of the terminal device. It is also relatively simple to operate.
  • the second way the user can press the button on the headset connected to the terminal device.
  • one of the at least two phone numbers corresponding to a contact name may be selected through a volume button on the headset to perform a dialing operation.
  • the volume down button sequentially selects one of the at least two phone numbers corresponding to one contact in the order from first to last to perform a dialing operation;
  • the volume up button selects a contact in order from back to front.
  • One of the corresponding at least two telephone numbers performs a dialing operation.
  • the dialing operation of the telephone number 1 can be selected by the volume up key of the earphone.
  • the third way is: performing voice interaction with the user through the earphone, and the terminal device selects one of the at least two phone numbers corresponding to the contact to perform a dialing operation by querying the user.
  • the terminal device knows that the user needs to make a call with the phone number 1 corresponding to the contact 1 by querying the user, and the terminal device performs a dialing operation according to the phone number 1.
  • the user's experience is emphasized and the user's experience is improved.
  • the contact avatar corresponds to at least two phone numbers, and one of the at least two phone numbers corresponding to the contact avatar may be implemented by the above manner. The number is dialed.
  • the interface content of the terminal device may include at least two contact names, each contact name corresponding to one phone number; or, at least two phone numbers; or, at least two contact avatars, each The contact avatar corresponds to a phone number.
  • three contact names are described as an example. If it is detected that the terminal device switches to the first mode, the terminal device selects a phone number corresponding to one of the three contact names to perform a dialing operation.
  • the terminal device selects the phone number corresponding to one of the three contact names to perform the dialing operation, and may include the following methods:
  • the first way the manufacturer can set up before the terminal device leaves the factory, or set by the user after leaving the factory.
  • the contact associated with the contact name of the first place by default is dialed.
  • the contact name sorted in the first place is the contact 1
  • the terminal device performs a dialing operation according to the contact associated with the contact 1.
  • the default settings it is very suitable for the operation of the terminal device, and the operation is relatively simple.
  • the second way the user can press the button on the headset connected to the terminal device.
  • one of the three contacts can be selected for dialing operation via the volume button on the headset.
  • the volume down button selects contact 1, contact 2, and contact 3 in order from first to last;
  • the volume up button selects contact 3, contact 2, and contact in order from back to front. 1.
  • the user selects the contact 1 through the volume button, and the terminal device performs a dialing operation according to the contact associated with the contact 1.
  • the third way is: performing voice interaction with the user through the earphone, and the terminal device selects one of the three contacts to perform a dialing operation by asking the user.
  • the terminal device receives an indication that the user wants to dial the contact 1 by interacting with the user voice, and the terminal device performs a dialing operation according to the contact associated with the contact 1.
  • the user's experience is emphasized and the user's experience is improved.
  • the terminal device may also select a contact associated with one of the at least two contact avatars to perform a dialing operation.
  • the user content of the terminal device includes at least two contacts, each contact corresponding to at least two phone numbers; or, at least two contact avatars, each contact avatar corresponding to at least two phones number.
  • two contacts, and two phone numbers corresponding to each contact are described as an example. If it is detected that the terminal device switches to the first mode, the terminal device selects a phone number corresponding to one of the two contacts to perform a dialing operation.
  • the terminal device selects a phone number corresponding to one of the two contacts to perform the dialing operation, and may include the following methods:
  • the first way the manufacturer can set up before the terminal device leaves the factory, or set by the user after leaving the factory. For example, by default, the first-order object is sorted for dialing.
  • the telephone number corresponding to the contact 1 is the telephone number 1 (xxxx-xxxxxxxx) and the telephone number 2 (xxx-xxxx-xxxx)
  • the telephone number corresponding to the contact 2 is the telephone number 3 (xxxx-xxxxxxxx) ) and phone number 4 (xxx-xxxx-xxxx).
  • the telephone number 1 corresponding to the contact 1 is sorted in the first place, and the terminal device performs a dialing operation according to the telephone number 1. With the default settings, it is very suitable for the operation of the terminal device, and the operation is relatively simple.
  • the second way the user can press the button on the headset connected to the terminal device.
  • the object can be dialed by the volume button on the headset.
  • the volume down key selects the objects in order from the first to the last to perform the dialing operation;
  • the volume up key selects the objects in the order from the back to the front to perform the dialing operation.
  • Figure 4d Assume that the user selects the phone number 1 corresponding to the contact through the volume button, and the terminal device performs a dialing operation according to the phone number 1.
  • the third way is: performing voice interaction with the user through the earphone, and the terminal device selects the object to perform a dialing operation by asking the user.
  • the terminal device selects the object to perform a dialing operation by asking the user.
  • the user needs to make a call with the phone number 1 corresponding to the contact 1, and the terminal device receives the instruction of the user and performs a dialing operation according to the phone number 1.
  • the user's experience is emphasized and the user's experience is improved.
  • the interface content of the terminal device includes at least two contact avatars, and the contact avatar corresponds to at least two phone numbers, the contact avatar corresponding to at least two contact avatars may be implemented in the foregoing manner.
  • the interface content of the terminal device includes a contact name and a phone number; or, a phone number and a contact avatar; or, a contact name and a contact avatar; or, a contact name, a phone number, and a contact Human avatar.
  • Each of the above contact names, each contact avatar corresponds to a phone number.
  • the contact name and contact avatar are described as an example. If it is detected that the terminal device switches to the first mode, the terminal device selects a contact name or a contact associated with the contact avatar to perform a dialing operation.
  • the terminal device selects the contact name or the contact associated with the contact avatar to perform the dialing operation.
  • the first method the production merchant can be set before the terminal device leaves the factory, or set by the user after leaving the factory. For example, the default sorting of objects in the first order performs a dialing operation.
  • the contact names are sorted in the first order, and the terminal device performs a dialing operation according to the contact associated with the contact name.
  • the second way through the buttons on the headset connected to the terminal device.
  • the volume down key selects the objects in order from the first to the last to perform the dialing operation
  • the volume up key selects the objects in the order from the back to the front to perform the dialing operation.
  • the contact name can be selected through the volume button on the headset, and the terminal device performs a dialing operation according to the contact associated with the contact name.
  • the third way is: performing voice interaction with the user through the earphone, and the terminal device selects a dialing operation on the object by asking the user.
  • the terminal is set up by asking the user, and the user instructs the terminal device to communicate with the voice. Person 1 makes a call.
  • the terminal device performs a dialing operation according to the contact associated with the contact 1. By interacting with the user's voice, the user's experience is emphasized and the user's experience is improved.
  • the interface content of the terminal device includes a contact name and a phone number; or, a phone number and a contact avatar; or, a contact name and a contact avatar; or, a contact name, a phone number, and a contact Avatar.
  • Each of the contact names and contact avatars described above corresponds to at least two phone numbers.
  • the contact name and the contact avatar, and the contact avatar and the contact avatar respectively correspond to two phone numbers as an example. If it is detected that the terminal device switches to the first mode, the terminal device selects a contact number or a phone number corresponding to the contact avatar to perform a dialing operation.
  • the terminal device selects a contact name or a phone number corresponding to the contact avatar to perform the dialing operation, and may include the following methods:
  • the first way the manufacturer can set up before the terminal device leaves the factory, or set by the user after leaving the factory. For example, the first bit of the object is sorted by default.
  • the telephone number corresponding to the contact 1 is the telephone number 1 (xxxx-xxxxxxxx) and the telephone number 2 (xxx-xxxx-xxxx), and the telephone number corresponding to the contact avatar is the telephone number 3 (xxxx-xxxxxxxx).
  • phone number 4 (xxx-xxxx-xxxx).
  • the telephone number 1 corresponding to the contact 1 is sorted in the first place, and the terminal device performs a dialing operation according to the telephone number 1. With the default settings, it is very suitable for the operation of the terminal device, and the operation is relatively simple.
  • the second way the user can press the button on the headset connected to the terminal device.
  • the object can be dialed by the volume button on the headset.
  • the volume down key selects the objects in order from the first to the last to perform the dialing operation;
  • the volume up key selects the objects in the order from the back to the front to perform the dialing operation.
  • Figure 5c Assume that the user selects the phone number 1 corresponding to the contact 1 through the volume button, and the terminal device performs a dialing operation according to the phone number 1.
  • the third way is: performing voice interaction with the user through the earphone, and the terminal device selects the object to perform a dialing operation by asking the user.
  • the terminal device selects the object to perform a dialing operation by asking the user.
  • FIG. 5c by interacting with the user, the user needs to make a call with the phone number 1 corresponding to the contact 1, and the terminal device receives the user's instruction and performs a dialing operation according to the phone number 1.
  • the user's experience is emphasized and the user's experience is improved.
  • the interface content of the terminal device includes text information, picture information, contact name or phone number, or contact avatar.
  • the interface content of the terminal device includes picture information, and one or more of a contact name, a phone number, and a contact avatar.
  • the interface content of the terminal device includes text information, and one or more of a contact name, a phone number, and a contact avatar. As shown in Figure 6a.
  • text information is converted to speech.
  • the user may be queried by voice when encountering the contact name, or after the interface content is read, the voice may be queried whether the user wants to perform a dialing operation, if the user needs to be associated with the contact name.
  • the contact makes a call, and the terminal device performs a dialing operation according to the contact associated with the contact name.
  • the interface content of the terminal device includes text information and contact name, each contact The person's name corresponds to a phone number; or, text information and phone number; or, text information and contact avatar, each contact avatar corresponds to a phone number.
  • the text information and the contact name are explained as an example. If it is detected that the terminal device switches to the first mode, the terminal device can perform voice interaction with the user, and when the user gives an indication of the text information to voice, the terminal device converts the text information into voice; or, the user gives an indication to the dialing The terminal device performs a dialing operation according to the contact associated with the contact name.
  • the interface content of the terminal device includes text information and a contact name, each contact name corresponding to at least two phone numbers; or, text information and a phone number; or, text information and a contact avatar, Each contact avatar corresponds to at least two phone numbers.
  • the text information and the contact 1 the contact 1 corresponding phone number 1 (xxxx-xxxxxx) and the telephone number 2 (xxx-xxxx-xxxx) are described as an example.
  • the terminal device may perform voice interaction with the user, and when the user gives an indication of the text information to voice, the terminal device converts the text information into voice; or, the user gives the contact 1
  • the telephone number 1 is instructed to dial, and the terminal device performs a dialing operation based on the telephone number 1.
  • the interface content of the terminal device includes text information, a contact name, and a phone number, and each contact name corresponds to a phone number; or, text information, a phone number, and a contact avatar, each contact The avatar corresponds to a phone number; or, the text message, the contact name, and the contact avatar, each contact name and each contact avatar respectively correspond to a phone number; or, text information, contact name, phone number, and contact person The avatar, each contact name and each contact avatar respectively correspond to a phone number.
  • text information, contact name, phone number, and contact avatar are described as an example.
  • the terminal device can perform voice interaction with the user, and when the user gives an indication of the text information to voice, the terminal device converts the text information into voice; or, the user gives the phone number to dial Instructed by the terminal device to perform a dialing operation based on the contact associated with the telephone number.
  • the interface content of the terminal device includes text information, a contact name, and a phone number, each contact name corresponding to at least two phone numbers; or, text information, a phone number, and a contact avatar, each The contact avatar corresponds to at least two phone numbers; or, the text information, the contact name, and the contact avatar, each contact name and each contact avatar respectively correspond to at least two phone numbers; or, text information, contact name , phone number and contact avatar, each contact name and each contact avatar respectively correspond to at least two phone numbers.
  • the contact information 1 corresponding to the telephone number 1 (xxxx-xxxxxx) and the telephone number 2 (xxx-xxxx-xxxx) are described as an example with text information, a contact 1, a telephone number, and a contact avatar. If it is detected that the terminal device is switched to the first mode, the terminal device may perform voice interaction with the user, and when the user gives an indication of the text information to voice, the terminal device converts the text information into voice; or, the user gives the contact 1 The telephone number 1 is instructed to dial, and the terminal device performs a dialing operation based on the telephone number 1.
  • the interface content of the terminal device includes picture information including one or more of text information, contact name, phone number, and contact avatar. If the detecting terminal device switches to the first mode, the terminal device converts the text information in the picture information to voice information, and the terminal device performs a dialing operation according to the contact name, the phone number, and the contact associated with the contact avatar.
  • the picture information includes text information, as well as a contact name or phone number or contact picture.
  • the picture information includes a contact name or a phone number or a contact picture
  • each contact corresponds to one phone number or multiple phone numbers; or, at least two contact avatars, each contact avatar corresponding to one phone number or multiple phone numbers Or, select one of the at least two phone numbers to dial, according to the default settings of the terminal device system; or, use the volume button of the headset; or, use voice communication.
  • each contact corresponds to one phone number or multiple phone numbers; or, at least two contact avatars, each contact avatar corresponding to one phone number or multiple phone numbers
  • select one of the at least two phone numbers to dial according to the default settings of the terminal device system; or, use the volume button of the headset; or, use voice communication.
  • the contact name may include a contact that has been stored in the terminal device; and may include, but is not limited to, an account contact that is registered with a phone number, such as instant messaging, or is bound to a phone number.
  • the contact avatar may include: an avatar set by the contact stored by the user in the terminal device; and may include, but is not limited to, an avatar of the account contact registered with the phone number or bound with the phone number, such as instant messaging.
  • Instant messaging refers to a service that can send and receive Internet messages and the like.
  • the first mode refers to an operation mode in which the terminal device accesses the earphone or the earphone function inside the terminal device.
  • the method before the step of the terminal device converting the interface content of the terminal device into the voice information for voice output, the method further includes: the terminal device detecting whether the interface content includes the voice information of the terminal device system; if the interface content does not include the terminal The language information of the device system converts the interface content into the target voice information; or converts the interface content into language information corresponding to the user requirement.
  • the terminal device converts the interface content of the terminal device into the voice information of the terminal device system according to the default setting of the terminal device; or the terminal device communicates with the user. If the user issues an instruction to perform translation, in one case, the user gives the target language, the terminal device converts the interface content of the terminal device into the target language, and in another case, the user does not give the target voice, and the terminal device connects the terminal.
  • the interface content of the device is converted into voice information of the terminal device system; if the user does not release the translation instruction, the interface content of the terminal device is directly converted into voice information.
  • the language information of the interface content of the terminal device is consistent with the language information of the terminal device system.
  • the system language of the terminal device and the voice of the text information are all in simplified Chinese.
  • the terminal device can directly convert the interface content of the terminal device into text information without performing the conversion operation.
  • the language information may refer to a voice, and may also refer to a type of a language, for example, English, Chinese, and the like. Or a combination of the two.
  • Step 203 The terminal device performs a corresponding operation according to the interface content of the terminal device.
  • the terminal device when detecting that the working mode of the terminal device is switched to the first mode, detects the interface content of the terminal device, and converts the interface content of the terminal device into voice information for voice output, the first mode.
  • the working mode of the headset function for the terminal device to enter the headset or to activate the terminal device.
  • the operation mode of the terminal device is detected to be switched to the first mode, the operation of detecting the interface of the terminal device is performed. Compared with detecting that the working mode of the terminal device is switched to the first mode, the power consumption of the terminal device is saved.
  • the voice function of the terminal device when the working mode of the terminal device is switched to the first mode, the voice function of the terminal device can be triggered, and the interface content of the terminal device is converted into voice information for output.
  • the operation efficiency is improved, and the operation of the user to learn and memorize a large number of gestures is reduced, and the user experience is improved.
  • FIG. 8 is a schematic structural diagram of still another terminal device according to an embodiment of the present invention. As shown in FIG. 8, the terminal device includes an acquisition unit 810, a detection unit 820, an execution unit 830, an output unit 840, and a processing unit 850.
  • Figure 8 only shows a simplified design of the structure of the terminal device.
  • the terminal structure shown in FIG. 8 does not constitute a limitation to the terminal, and the terminal device may include more or less components than the illustration 8, for example, the terminal device may further include instructions for storing corresponding instructions of the communication algorithm. Storage unit.
  • the acquiring unit 810 is configured to acquire the interface content presented on the interface of the terminal device; the detecting unit 820 is configured to detect the working mode of the terminal device; and if the detecting unit 820 detects that the working mode of the terminal device is switched to the first mode, The executing unit 830 converts the interface content of the terminal device into voice information, and then performs output of the voice by the output unit 850.
  • the first mode is an operating mode in which the terminal device accesses the headset or activates the headset function of the terminal device.
  • the detecting unit 820 converts the interface content of the terminal device into voice information, and then outputs the output by the output unit 840. .
  • the embodiment of the invention can improve the operation efficiency, reduce the operation of the user to learn and memorize a large number of gestures, and improve the user experience.
  • the processing unit is configured to determine the type of interface element in the interface content.
  • the type of the interface element in the interface content includes one or more of text information, picture information, contact name, phone number, and contact avatar.
  • the type of the interface element in the interface content includes picture information including one or more of text information, contact name, phone number, and contact avatar.
  • the interface content of the terminal device includes text information
  • converting the interface content of the terminal device into voice information for output including: converting the text information into voice information, and performing voice output.
  • the interface content of the terminal device includes one or more of a contact name, a phone number, and a contact image; converting the interface content of the terminal device into voice information for output, including: according to the contact The contact associated with one or more of the person's name, phone number, and contact image is dialed.
  • the method before the step of converting the interface content into voice information for voice output, the method further includes: detecting whether the interface content includes system voice information of the terminal device; if the interface content does not include system language information of the terminal device, The interface content of the terminal device is converted into the target voice information; or the interface content is converted into language information corresponding to the user requirement.
  • the terminal device further includes an input unit 860, configured to receive a voice input of the user during the voice interaction between the terminal device and the user.
  • the method and device for triggering a voice function are provided in the embodiment of the present invention.
  • the working mode of the terminal device is switched to the first mode, and the interface content of the terminal device is converted into voice information for voice output.
  • the first mode is that the terminal device accesses the earphone or
  • the working mode of the headset function of the terminal device is activated.
  • the embodiment of the invention can improve the operation efficiency, reduce the operation of the user to learn and memorize a large number of gestures, and improve the user experience.
  • non-transitory media such as random access memory, read only memory, flash memory, hard disk, solid state disk, magnetic tape, floppy disk, optical disc, and any combination thereof.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Telephone Function (AREA)

Abstract

Un mode de réalisation de la présente invention concerne un procédé et un dispositif de déclenchement d'une fonction vocale. Le procédé consiste à : acquérir, par un dispositif terminal, un contenu d'interface affiché sur une interface du dispositif terminal ; et si un mode de fonctionnement du dispositif terminal est détecté comme étant commuté dans un premier mode, convertir le contenu d'interface en informations vocales, et délivrer en sortie une voix, le premier mode étant un mode de fonctionnement dans lequel le dispositif terminal est connecté à un casque d'écoute ou active une fonction de casque d'écoute de celui-ci. Le mode de réalisation de la présente invention peut améliorer l'efficacité de fonctionnement et réduire les opérations correspondant à un grand nombre de gestes à apprendre et mémoriser par un utilisateur, ce qui permet d'améliorer l'expérience de l'utilisateur.
PCT/CN2017/087984 2017-01-26 2017-06-12 Procédé et dispositif de déclenchement d'une fonction vocale WO2018137306A1 (fr)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201780004960.7A CN108605074B (zh) 2017-01-26 2017-06-12 一种触发语音功能的方法和设备

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201710061841.7 2017-01-26
CN201710061841 2017-01-26

Publications (1)

Publication Number Publication Date
WO2018137306A1 true WO2018137306A1 (fr) 2018-08-02

Family

ID=62978868

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/087984 WO2018137306A1 (fr) 2017-01-26 2017-06-12 Procédé et dispositif de déclenchement d'une fonction vocale

Country Status (2)

Country Link
CN (1) CN108605074B (fr)
WO (1) WO2018137306A1 (fr)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113835669B (zh) * 2020-06-24 2024-03-29 青岛海信移动通信技术有限公司 电子设备及其语音播报方法

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104184867A (zh) * 2014-08-29 2014-12-03 广东欧珀移动通信有限公司 一种智能移动终端来电语音播报联系人信息的方法及系统
CN104461545A (zh) * 2014-12-12 2015-03-25 百度在线网络技术(北京)有限公司 将移动终端中内容提供至用户的方法及装置
CN104461346A (zh) * 2014-10-20 2015-03-25 天闻数媒科技(北京)有限公司 一种视障人士触控屏幕的方法、装置及智能触屏移动终端
CN105657174A (zh) * 2016-01-26 2016-06-08 努比亚技术有限公司 一种语音转换方法和终端
CN105955609A (zh) * 2016-04-25 2016-09-21 乐视控股(北京)有限公司 一种语音阅读的方法及装置

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7650170B2 (en) * 2004-03-01 2010-01-19 Research In Motion Limited Communications system providing automatic text-to-speech conversion features and related methods
EP2761860B1 (fr) * 2011-09-30 2019-10-23 Apple Inc. Interfaces utilisateur s'adaptant automatiquement pour permettre une interaction sans les mains
CN102857637B (zh) * 2012-09-03 2016-03-23 小米科技有限责任公司 一种联系人信息获取方法、系统及装置
CN104104767B (zh) * 2013-04-07 2018-05-01 腾讯科技(深圳)有限公司 便携智能终端中联系人信息的处理方法和装置
CN104123114A (zh) * 2013-04-27 2014-10-29 腾讯科技(深圳)有限公司 一种进行语音播放的方法和装置
CN104142778B (zh) * 2013-09-25 2017-06-13 腾讯科技(深圳)有限公司 一种文本处理的方法、装置及移动终端
US9538226B2 (en) * 2013-12-06 2017-01-03 Samsung Electronics Co., Ltd. Method for operating moving pictures and electronic device thereof
CN103747511B (zh) * 2014-01-07 2018-03-09 加一联创电子科技有限公司 信息播报方法和系统
CN104346038B (zh) * 2014-09-24 2018-05-01 广东欧珀移动通信有限公司 终端信息读取方法和系统
CN104469027A (zh) * 2014-10-31 2015-03-25 百度在线网络技术(北京)有限公司 呼叫处理方法和装置
WO2016178984A1 (fr) * 2015-05-01 2016-11-10 Ring-A-Ling, Inc. Procédés et systèmes pour la gestion de vidéo et de tonalités entre dispositifs mobiles
CN105208232B (zh) * 2015-10-10 2018-11-20 网易(杭州)网络有限公司 一种自动拨打电话的方法和装置
CN105791502A (zh) * 2016-04-28 2016-07-20 北京小米移动软件有限公司 联系人查找方法及装置
CN106339160A (zh) * 2016-08-26 2017-01-18 北京小米移动软件有限公司 浏览交互处理方法及装置

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104184867A (zh) * 2014-08-29 2014-12-03 广东欧珀移动通信有限公司 一种智能移动终端来电语音播报联系人信息的方法及系统
CN104461346A (zh) * 2014-10-20 2015-03-25 天闻数媒科技(北京)有限公司 一种视障人士触控屏幕的方法、装置及智能触屏移动终端
CN104461545A (zh) * 2014-12-12 2015-03-25 百度在线网络技术(北京)有限公司 将移动终端中内容提供至用户的方法及装置
CN105657174A (zh) * 2016-01-26 2016-06-08 努比亚技术有限公司 一种语音转换方法和终端
CN105955609A (zh) * 2016-04-25 2016-09-21 乐视控股(北京)有限公司 一种语音阅读的方法及装置

Also Published As

Publication number Publication date
CN108605074A (zh) 2018-09-28
CN108605074B (zh) 2021-01-05

Similar Documents

Publication Publication Date Title
CN108021305B (zh) 应用关联启动的方法、装置及移动终端
WO2020042785A1 (fr) Procédé d'affichage d'application et terminal mobile
WO2017197650A1 (fr) Procédé et dispositif pour une interaction dans un appel
US10956025B2 (en) Gesture control method, gesture control device and gesture control system
CN108089891B (zh) 一种应用程序启动方法、移动终端
JP2021525430A (ja) 表示制御方法及び端末
JP7403648B2 (ja) 同期方法及び電子機器
WO2021129529A1 (fr) Procédé de commutation de dispositifs et dispositif associé
WO2019201271A1 (fr) Procédé de traitement d'appel et terminal mobile
US8620392B2 (en) Electronic device capable of continuing a telephone call when charging
CN111327458A (zh) 配置信息分享方法、终端设备及计算机可读存储介质
JP2021536077A (ja) 情報処理方法および端末
WO2013149530A1 (fr) Procédé d'affichage d'informations, terminal mobile et support à mémoire lisible par ordinateur
WO2020238445A1 (fr) Procédé d'enregistrement d'écran et terminal
WO2013135169A1 (fr) Procédé de réglage de saisie de clavier et terminal portable associé
WO2015043200A1 (fr) Procédé et appareil pour commander des applications et des opérations sur un terminal
WO2021037074A1 (fr) Procédé de sortie audio et appareil électronique
WO2020020213A1 (fr) Procédé d'entrée d'informations et terminal
WO2020063107A1 (fr) Procédé de capture d'écran et terminal
WO2020024770A1 (fr) Procédé pour déterminer un objet de communication, et terminal mobile
CN109683768B (zh) 一种应用的操作方法及移动终端
CN111104029A (zh) 快捷标识生成方法、电子设备及介质
WO2023284621A1 (fr) Procédé et appareil de réglage, dispositif électronique et support de stockage
US20150088525A1 (en) Method and apparatus for controlling applications and operations on a terminal
CN110769303A (zh) 播放控制方法、装置及移动终端

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17894126

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17894126

Country of ref document: EP

Kind code of ref document: A1