WO2019041512A1 - Method and device for enabling voice recognition function of terminal, earphone and terminal - Google Patents

Method and device for enabling voice recognition function of terminal, earphone and terminal Download PDF

Info

Publication number
WO2019041512A1
WO2019041512A1 PCT/CN2017/108583 CN2017108583W WO2019041512A1 WO 2019041512 A1 WO2019041512 A1 WO 2019041512A1 CN 2017108583 W CN2017108583 W CN 2017108583W WO 2019041512 A1 WO2019041512 A1 WO 2019041512A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice
terminal
voice recognition
recognition function
earphone
Prior art date
Application number
PCT/CN2017/108583
Other languages
French (fr)
Chinese (zh)
Inventor
陈大鹏
Original Assignee
歌尔科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 歌尔科技有限公司 filed Critical 歌尔科技有限公司
Publication of WO2019041512A1 publication Critical patent/WO2019041512A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72448User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Definitions

  • the present invention relates to the field of voice recognition technology, and in particular, to a method, device, earphone and terminal for enabling a voice recognition function of a terminal.
  • the use convenience of the terminal can be improved to some extent.
  • the voice recognition function of the terminal since the voice recognition function of the terminal is turned on, the terminal continues to actively acquire the voice in real time through the microphone regardless of whether the user is using the function. The sound in the surrounding environment, and when the intensity of the acquired sound is greater than the threshold, the acquired voice is recognized, so if the voice recognition function of the holding terminal is in the normally open state, a large amount of energy loss is caused.
  • the terminal has its own energy limitation. Therefore, the existing terminal with voice recognition function mostly keeps the function in the normally closed state. Only after the user manually turns on the function through physical buttons or virtual buttons, the user can normally Use this feature.
  • the physical button not only occupies the space of the terminal, but also increases the terminal volume, which leads to the terminal not being carried, and it is easy to malfunction due to multiple physical pressing, which affects the user's use, thereby reducing the user's experience. Moreover, when the user's hands are occupied, they cannot pass physical buttons or virtual buttons. Manually turning on the voice recognition function of the terminal further affects the user's use. Of course, in order to prevent the voice recognition function of the terminal from being opened for the voice recognition when the two hands are occupied, the user may also choose to keep the voice recognition function of the terminal normally open. However, as described above, due to the limited energy of the terminal, the function is In the case of a large loss of the terminal's own energy, it is easy to cause the terminal to have insufficient power and affect the use of the terminal, further affecting the use of the function.
  • An object of the present invention is to provide a method, a device, an earphone and a terminal for enabling a voice recognition function of a terminal, which can improve the convenience of use of the voice recognition function and further improve the user experience.
  • the present invention provides a method for opening a voice recognition function of a terminal through a headset, and the method for enabling a voice recognition function of a terminal through a headset includes:
  • an open command for turning on the voice recognition function is sent to the terminal to enable the terminal to perform voice recognition.
  • the method for opening the voice recognition function of the terminal through the earphone further includes:
  • voice transmission link When it is detected that the voice transmission link is turned on, voice can be transmitted to the terminal through the voice transmission link.
  • the method for enabling the terminal voice recognition function by using the earphone further includes:
  • the opening instruction for sending the voice recognition function to the terminal is specifically:
  • the open command is sent to the terminal through an instruction channel based on the BLE protocol.
  • the present invention further provides an apparatus for enabling a voice recognition function of a terminal through a headset, and the apparatus for enabling a voice recognition function of a terminal by using a headset includes:
  • the opening module is configured to, when determining that the voice includes a preset keyword, send an open command to enable the voice recognition function to enable the terminal to perform voice recognition.
  • the present invention also provides an earphone, the earphone including a memory and a processor, the processor performing voice recording by using an instruction stored in the memory to perform any of the above-mentioned voice-opening through the earphone Functional method.
  • the present invention further provides a method for enabling a voice recognition function of a terminal, where the method for enabling a voice recognition function of the terminal includes:
  • the speech recognition function is turned on for speech recognition.
  • the method for enabling the voice recognition function of the terminal further includes:
  • the method for enabling the voice recognition function of the terminal further includes:
  • the preset shutdown condition specifically includes:
  • the signal strength of the voice is lower than the first preset value and the duration is up to the second preset value;
  • the signal strength of the voice is higher than the third preset value, and the duration is up to the fourth preset value;
  • the first preset value is smaller than the third preset value.
  • the present invention also provides a voice recognition function for opening a terminal.
  • the device, the device for enabling the voice recognition function of the terminal includes:
  • a receiving module configured to receive an open instruction for turning on the voice recognition function sent by the earphone
  • the module is turned on for turning on the voice recognition function for voice recognition.
  • the present invention also provides a terminal, the terminal comprising a memory and a processor, the processor performing the above-mentioned voice recognition function of the open terminal by calling an instruction stored in the memory method.
  • the method for enabling a voice recognition function of a terminal through a headset includes collecting voice, and when determining that the voice includes a preset keyword, sending an open command for turning on the voice recognition function to the terminal Make the terminal perform speech recognition. It can be seen that the method can be used in the state that the voice recognition function of the terminal is off, without manual operation, and only the voice containing the preset keyword can be used to send an open command for turning on the voice recognition function to open the voice of the terminal.
  • the recognition function performs voice recognition.
  • the method provided by the present invention can perform voice recognition through the earphone and automatically turn on the voice recognition function of the terminal, so that the terminal does not need to open the voice recognition function for a long time, which can not only reduce the power consumption of the terminal, but also improve the performance. Users use the convenience of voice recognition to further enhance the user experience.
  • the present invention also provides a device for opening a terminal voice recognition function through a headphone and an earphone, the effect is as above.
  • the present invention also provides a method for turning on the voice recognition function of the terminal, comprising: receiving an opening instruction of the voice recognition function sent by the earphone; and opening the voice recognition function to perform voice recognition.
  • the method provided by the present invention can be used in the state that the voice recognition function of the terminal is off, without manual operation, and the terminal can open its own voice recognition function for voice recognition only by receiving an open command sent by the earphone.
  • the method can not only reduce the power consumption of the terminal, but also improve the convenience of the user using the voice recognition function, thereby further improving the user experience.
  • the present invention also provides an apparatus for opening a voice recognition function of a terminal and a terminal, the effect is as above.
  • FIG. 1 is a flowchart of a method for enabling a voice recognition function of a terminal through a headset according to an embodiment of the present invention
  • FIG. 2 is a flowchart of another method for enabling a voice recognition function of a terminal through a headset according to an embodiment of the present invention
  • FIG. 3 is a flowchart of another method for enabling a voice recognition function of a terminal through a headset according to an embodiment of the present invention
  • FIG. 4 is a structural diagram of an apparatus for enabling a voice recognition function of a terminal through a headset according to an embodiment of the present invention
  • FIG. 5 is a flowchart of a method for enabling a voice recognition function of a terminal according to an embodiment of the present invention
  • FIG. 6 is a flowchart of another method for enabling a voice recognition function of a terminal according to an embodiment of the present invention.
  • FIG. 7 is a flowchart of another method for enabling a voice recognition function of a terminal according to an embodiment of the present invention.
  • FIG. 8 is a structural diagram of an apparatus for enabling a voice recognition function of a terminal according to an embodiment of the present invention.
  • FIG. 9 is a schematic diagram of an application scenario of a headset and a terminal according to an embodiment of the present invention.
  • An object of the present invention is to provide a method for voice recognition, which can improve the convenience of a user using a voice recognition function, thereby improving the user experience.
  • the earphone mentioned in the present invention needs to establish a communication connection with the terminal, and the communication module required for establishing the communication connection may be a Bluetooth module or another type of communication module, as long as the communication module of the earphone and the terminal matches.
  • the terminal can be other types of electronic products such as mobile phones.
  • FIG. 1 is a flowchart of a method for enabling a voice recognition function of a terminal through a headset according to an embodiment of the present invention. As shown in Figure 1, the method includes:
  • the voice collection module of the earphone collects the voice input by the user in real time, and then the local voice recognition module identifies whether the collected voice includes a preset keyword, and if yes, sends an open command to the terminal.
  • the specific process of the headset recognizing the preset keyword is: converting the collected voice into text, and then comparing the text with the text of the preset keyword in the database, if the text includes If the text of the preset keyword has the same text, it is determined that the voice collected by the earphone includes a preset keyword. If there is no text in the text that is the same as the preset keyword, it is determined that there is no preset key in the voice collected by the earphone. word.
  • the preset keyword mentioned in step S11 refers to a keyword that turns on the voice recognition function of the terminal.
  • the content of the preset keyword is not limited, and may be letters, numbers, Chinese, etc. The more content included in the preset keyword, the slower the recognition process, but the error can be effectively prevented.
  • the operation can be selected according to the actual situation, and the invention is not limited.
  • the voice recognition function For example, if the default keyword is "turn on speech recognition", then when the user says “I am to the headset"
  • the voice recognition function is turned on (the voice includes the preset keyword "turn on the voice recognition function")
  • the first voice collection module of the earphone collects the voice spoken by the user to the earphone, and is performed by the earphone.
  • the local voice recognition module identifies the voice, and finally determines that the voice includes a preset keyword (the preset keyword is “turning on the voice recognition function”), and sends an open command for turning on the voice recognition function to the terminal, then the terminal receives the voice. After the command is turned on, the voice recognition function is turned on for voice recognition.
  • the preset keyword for enabling the voice recognition function may be one or more, and if there are multiple preset keywords, the database includes texts of multiple preset keywords. Then, the user inputs a voice containing any preset keyword to achieve the purpose of turning on the terminal voice recognition function.
  • two preset keywords may be set to respectively represent keywords of the local voice recognition function of the terminal and keywords of the network voice recognition function of the terminal.
  • the open command carries corresponding information, for example, the head of the data corresponding to the open command is set to 1, indicating that the terminal local voice recognition function is adopted; the head of the data corresponding to the open command is set to 0, and the network is adopted. Speech recognition.
  • the terminal when the headset receives the keyword that includes the local voice recognition function of the terminal, the terminal sends an open command for opening the local voice recognition function of the terminal; when the headset receives the key of the network voice recognition function including the enabled terminal In the case of a word, an open command to turn on the network voice recognition function of the terminal is sent to the terminal.
  • the method for enabling the voice recognition function of the terminal through the earphone provided by the embodiment of the present invention can be performed in the state that the voice recognition function of the terminal is off, without manual operation, and can only be transmitted to the terminal through the voice containing the preset keyword. Sending the open command of the voice recognition function to enable the voice recognition function of the terminal for voice recognition. Therefore, the method can not only reduce the power consumption of the terminal, but also improve the convenience of the user using the voice recognition function, thereby further improving the user experience.
  • the quality of the voice to be recognized obtained by the microphone of the terminal itself is poor, which may result in a decrease in the success rate of voice recognition and affect the user.
  • the user can move the terminal to a place with a small distance from itself so that the terminal can obtain a better-quality voice to be recognized, but because of the terminal The portability is poor.
  • the present invention considers that the earphone has better portability.
  • the terminal can collect the higher-quality voice to be recognized through the earphone. Therefore, after the headset sends an open command to the terminal, the terminal can also establish a voice transmission link with the earphone, so that the terminal can acquire the high quality to be recognized by the portable headset through the voice transmission link. voice.
  • FIG. 2 is a flowchart of another method for enabling a voice recognition function of a terminal through a headset according to an embodiment of the present invention.
  • the headset voice transmission link transmits the high-quality to-be-identified voice collected by the headset to the terminal, as shown in FIG. 2 .
  • step S11 after performing step S11 on the basis of FIG. :
  • the voice transmission link refers to a transmission link between the terminal and the earphone for transmitting the voice to be recognized
  • the voice mentioned in step S21 is after the voice transmission link is detected by the earphone after the opening command is sent.
  • the voice collected at the time of the connection is the voice at the different time from the voice mentioned in the step S10.
  • the voice mentioned in the step S10 is the voice spoken by the user before the voice transmission link is turned on; step S21
  • the voice mentioned in the voice is the voice spoken by the user after the voice transmission link is connected.
  • the headset does not detect that the voice transmission link is turned on, and the voice collected by the headset in real time is the voice in step S10. If, at the next moment, the headset detects that the voice transmission link has been turned on, the voice collected by the headset in real time is the voice in step S21.
  • the headset when the headset receives the request for the terminal to initiate a voice transmission link, the headset immediately responds to the request initiated by the terminal to establish a voice transmission link, so that the terminal establishes a voice transmission link between the two as soon as possible. And when the headset detects that the voice transmission link between itself and the terminal is connected, the headset starts to collect the voice in real time and the voice is transmitted to the terminal in real time through the voice transmission link.
  • step S21 when the user inputs the speech to be recognized If the machine is not correct, it may cause speech recognition to fail.
  • the terminal has not fully turned on the voice recognition function or has not been established yet.
  • the voice transmission link is good, and the terminal cannot obtain the voice to be recognized when the user turns on the voice recognition function and the voice transmission link is established when the terminal establishes the voice transmission link, so that the voice recognition fails. Therefore, in order to enable the user to accurately input the timing of the voice to be recognized, thereby improving the success rate of the voice recognition, thereby improving the user experience, the following manner may also be adopted in other embodiments.
  • the headset when the headset detects that the voice transmission link is turned on and receives the prompt command sent by the terminal, the headset plays a prompt signal to the user to start inputting the voice to be recognized to help the user accurately grasp the timing of inputting the voice to be recognized. And the earphone can transmit the voice collected by the earphone to the terminal through the voice transmission link, so that the terminal can obtain all the voices to be recognized to improve the success rate of the voice recognition, thereby improving the user experience.
  • the method provided by the embodiment can respond to the request initiated by the terminal to establish a voice transmission link by using a headset with better portability, and when the voice transmission link is detected to be connected, the earphone can be real-time through the voice transmission link.
  • the voice collected by the user is transmitted to the terminal, so that the terminal can obtain the high-quality to-be-recognized voice through the better-capable earphone, and the terminal cannot successfully identify the user because the terminal cannot obtain the high-quality to-be-recognized voice through the microphone.
  • the voice you want to recognize is the impact of the user experience. Therefore, the method can improve the success rate of voice recognition, thereby improving the user experience.
  • the terminal after the terminal successfully obtains the to-be-recognized voice or successfully obtains the recognition result, the terminal closes the voice transmission link. At this time, the earphone may stop transmitting the collected voice to the terminal to reduce the power consumption of the earphone itself. Therefore, the embodiment of the present invention provides another method for opening the voice recognition function of the terminal through the earphone, which is further improved on the basis of FIG. 2, and is described in detail below with reference to the accompanying drawings.
  • FIG. 3 is a flowchart of another method for enabling a voice recognition function of a terminal through a headset according to an embodiment of the present invention. As shown in FIG. 3, in order to reduce the power consumption of the earphone itself, as a preferred embodiment, On the basis of FIG. 2, after performing step S21, the method further includes:
  • the earphone still collects voice in real time. At this time, for the earphone, the collected voice is used to determine whether or not the voice is included. Set keywords so that the user can turn on the voice recognition function of the terminal again through the voice recognition function.
  • the used voice is used for different purposes, that is, the voice collected before the open command is sent to the terminal in a complete cycle. It is used to judge whether the preset keyword is included, and after the opening command is sent to the terminal, the collected voice is used for transmission to the terminal for voice recognition. Therefore, in a specific implementation, the earphone includes two acquisition modules, which are a first acquisition module and a second collection module, and the two acquisition modules do not work at the same time.
  • the user's experience is improved, and after performing step S30, the earphone can also receive the audio in the recognition result sent by the terminal, and play The audio.
  • step S11 the method for opening the voice recognition function of the terminal through the earphone provided by the embodiment of the present invention
  • step S30 is continued, More details will be described.
  • the opening command for transmitting the voice recognition function to the terminal is specifically: sending an open command to the terminal through the command channel based on the BLE protocol.
  • Other communication protocols may be used in addition to the BLE protocol, and are not described in this embodiment.
  • the foregoing provides a detailed description of an embodiment of a method for enabling a voice recognition function of a terminal through a headset, and the present invention further provides a device for enabling a voice recognition function of a terminal through a headset corresponding to the method,
  • the embodiment of the device part and the embodiment of the method part are mutually responsive, so the embodiment of the device part can be referred to the description of the embodiment of the method part. Said.
  • FIG. 4 is a structural diagram of an apparatus for enabling a voice recognition function of a terminal through a headset according to an embodiment of the present invention. As shown in Figure 4, the device comprises:
  • the acquisition module 40 is configured to collect voice.
  • the opening module 41 is configured to send an opening instruction for turning on the voice recognition function to the terminal to perform voice recognition when determining that the voice includes the preset keyword.
  • the collection module 40 includes a first voice collection module and a second voice collection module, and the first voice collection module and the second voice collection module are used to collect voice in real time, but the voice collected by the first voice collection module is used as a headset.
  • the voice recognized by the local voice recognition module; and the voice collected by the second voice collection module is used as the voice transmitted by the earphone to the terminal for the terminal to perform voice recognition on the voice.
  • the first voice collection module is kept in an open state to collect voice in real time, so that the local voice recognition module of the earphone recognizes the first voice collection module in real time.
  • the second voice collection module only detects the voice transmission chain after the local voice recognition module of the earphone itself recognizes that the voice collected by the first voice collection module includes a preset keyword.
  • the first voice collection module will be turned off at the same time.
  • the keyword is also stored in the headset, the first voice collection module can still be kept on when the second voice collection module is turned on.
  • the headset when detecting that the voice transmission link has been turned off, the headset will turn off the second voice collection module to save its own power consumption, and simultaneously turn on its own first voice collection module, so that the user needs the next time.
  • the voice recognition function of the terminal When the voice recognition function of the terminal is used, the voice recognition function of the terminal can be opened more conveniently by voice, thereby improving the user experience.
  • the keyword is also stored in the headset, the first voice collection module is not turned off when the second voice collection module is turned on, but the first voice collection module is still turned on. In the state, when the second voice collection module is turned off by the earphone, it is not necessary to turn on the first voice collection module again.
  • the device for enabling the voice recognition function of the terminal through the earphone provided by the embodiment may be in a state where the voice recognition function of the terminal is off, without manual operation, and only the voice containing the preset keyword collected by the collection module may be turned on.
  • the module sends an open command for turning on the voice recognition function to the terminal to enable the voice recognition function of the terminal for voice recognition. Therefore, the device can perform voice recognition through the earphone and automatically turn on the voice recognition function of the terminal, so that the terminal does not need to open the voice recognition for a long time.
  • the function not only can reduce the power consumption of the terminal, but also can improve the convenience of the user to use the voice recognition function, thereby further improving the user experience.
  • an embodiment of the present invention further provides an earphone including a memory and a processor, and the processor implements the method provided by any of the foregoing embodiments by calling an instruction stored in the memory.
  • the earphone includes a headphone body, such as a communication module, a battery, a speaker, an earpiece, and the like.
  • the earphone provided by the embodiment can be used in the state that the voice recognition function of the terminal is off, and no manual operation is required. Only the voice containing the preset keyword can be used to send an open command for turning on the voice recognition function to the terminal.
  • the voice recognition function of the terminal is enabled to perform voice recognition. Therefore, the headset can automatically send an instruction to turn on the voice recognition function to the terminal through the voice containing the preset keyword, so that the terminal automatically turns on the voice recognition function of the terminal, so that the terminal does not need to be long. Turning on the voice recognition function not only reduces the power consumption of the terminal, but also improves the convenience of the user using the voice recognition function, thereby further improving the user experience.
  • FIG. 5 is a flowchart of a method for enabling a voice recognition function of a terminal according to an embodiment of the present invention. As shown in FIG. 5, the method includes:
  • S50 Receive an open command for turning on the voice recognition function sent by the earphone.
  • the terminal when the terminal receives the opening instruction of the voice recognition function that is sent by the earphone, the terminal immediately turns on its own voice recognition function to perform voice recognition.
  • the terminal receives the local voice recognition sent by the earphone When the function is turned on, the terminal turns on its own local voice recognition function. If the terminal receives the opening command of the network voice recognition function sent by the headset, the terminal turns on its own network voice recognition function. Preferably, when the terminal receives the opening instruction of the voice recognition function that is sent by the earphone, the terminal turns on the local voice recognition function to recognize the voice input by the user by default, and when the local voice recognition function cannot successfully recognize the voice input by the user, The terminal turns off the local voice recognition function and simultaneously turns on the network voice recognition function to recognize the voice input by the user.
  • the method for enabling the voice recognition function of the terminal in the embodiment can be performed in the state that the voice recognition function of the terminal is off, without manual operation, and the terminal can start its own voice recognition only by receiving an open command sent by the earphone.
  • the method can not only reduce the power consumption of the terminal, but also improve the convenience of the user using the voice recognition function, thereby further improving the user experience.
  • the quality of the voice to be recognized obtained by the microphone of the terminal itself is poor, which may result in a decrease in the success rate of voice recognition and affect the user.
  • the user can move the terminal to a place with a small distance from the user so that the terminal can obtain a better-quality voice to be recognized, but because the portability of the terminal is poor, when the user's hands are occupied, the user is difficult. Ensure the distance between itself and the terminal. In view of this, the present invention considers that the earphone has better portability.
  • the terminal can collect the higher-quality voice to be recognized through the earphone. Therefore, after the headset sends an open command to the terminal, the terminal can also establish a voice transmission link with the earphone to obtain high quality to-be-recognized voice collected by the portable headset through the voice transmission link. .
  • FIG. 6 is a flowchart of another method for enabling a voice recognition function of a terminal according to an embodiment of the present invention.
  • the terminal obtains the high-quality to-be-recognized voice collected by the earphone through the voice transmission link, as shown in FIG. 6.
  • the terminal further includes :
  • S60 Initiating a request to establish a voice transmission link to the headset to establish a voice transmission link.
  • S61 Receive voice sent by the earphone through a voice transmission link, and perform voice recognition.
  • the voice transmission link refers to a transmission link for transmitting a voice to be recognized before the terminal and the earphone, and the voice mentioned in step S61 is after the voice recognition function is enabled after the terminal starts the voice recognition function. , the voice collected by the headset.
  • step S60 when the terminal turns on the voice recognition function, if it is detected that the earphone is connected to itself, the headset connected to the self initiates a request for establishing a voice transmission link between the two, once the headset is received. Responding to the response message initiated by the terminal to establish a request for the voice transmission link between the two, the voice transmission link between the two is established immediately. After the voice transmission link between the two is established, the voice is transmitted. The transmission link receives the voice sent by the headset and performs voice recognition.
  • the terminal may not send a request for establishing a voice transmission link with the earphone to the earphone, but by using its own microphone or other connection with itself.
  • the device that can obtain the voice obtains the voice to be recognized input by the user, and the detailed process is not described in detail in the present invention.
  • step S61 when the timing of the user inputting the voice to be recognized is incorrect, the voice recognition may be caused to fail, for example, after the headset sends the open command, the voice transmission between the headset and the terminal.
  • the link is not yet established, but for the voice that has already started to be recognized, the user cannot obtain the voice to be recognized when the terminal starts the voice recognition function and the voice transmission link is established in the terminal. , causing speech recognition to fail or be incomplete. Therefore, in order to enable the user to accurately input the timing of the voice to be recognized, thereby improving the success rate of the voice recognition, thereby improving the user experience, as a preferred embodiment, after the terminal establishes the voice transmission link, it also sends the voice transmission to the headset.
  • the prompting instruction is convenient for the earphone to play the prompting signal for starting to input the voice to be recognized after receiving the prompting instruction sent by the terminal, to help the user accurately grasp the timing of inputting the voice to be recognized, and the terminal can acquire the earphone through the voice transmission link.
  • Voice The terminal can obtain all the voices to be recognized to improve the success rate of voice recognition, thereby improving the user experience.
  • the present invention fully considers that the speech recognition success rate of the terminal has a great relationship with the distance between the terminal and the user, and the portability of the terminal is far from the portability of the earphone, and the terminal passes
  • the voice transmission link receives the voice to be recognized input by the user collected by the earphone, and avoids the voice recognition failure caused by the distance between the user and the terminal, which affects the user experience. Therefore, the method can improve the success rate of the voice recognition of the terminal, thereby improving the user experience.
  • the embodiment of the present invention provides another method for turning on the voice recognition function of the terminal, which is further improved on the basis of FIG. 6, and is described in detail below with reference to the accompanying drawings.
  • FIG. 7 is a flowchart of another method for enabling a voice recognition function of a terminal according to an embodiment of the present invention. As shown in FIG. 7 , in order to reduce the power consumption of the terminal itself, as a preferred embodiment, after performing step S61 on the basis of FIG. 6 , the method further includes:
  • the foregoing preset closing conditions specifically include:
  • the signal strength of the voice is lower than the first preset value and the duration is up to the second preset value; or the signal strength of the voice is higher than the third preset value, and the duration is up to the fourth preset value; or the voice recognition result is obtained.
  • the first preset value is smaller than the third preset value.
  • the terminal disconnects the voice transmission link. , turn off its own speech recognition function.
  • the first preset value is the lowest signal strength of the terminal recognition voice
  • the second preset value is 2 seconds
  • the terminal considers that After the voice to be recognized has been input, the voice transmission link is disconnected and its voice recognition function is turned off.
  • the terminal disconnects the voice transmission link. shut down Its own voice recognition function. For example, when the third preset value is the highest signal strength of the terminal recognition voice, and the fourth preset value is 10 seconds, if the signal strength of the voice acquired by the terminal is greater than the highest signal strength within 10 seconds, the terminal considers that If the voice input is incorrect, the voice transmission link will be disconnected and its voice recognition function will be turned off.
  • the terminal After the terminal obtains the recognition result, the terminal considers that the voice recognition is completed, the voice transmission link is disconnected, and the voice recognition function is turned off.
  • the terminal may send the audio of the recognition result to the earphone to play the audio through the earphone.
  • the audio in the recognition result can also be played by other broadcasting devices, such as the speaker of the terminal itself and the sound connected to the terminal.
  • the terminal may also display the recognition result through the display screen, wherein
  • the display screen may be the display screen of the terminal itself, or may be a display device connected to the terminal, for example, the recognition result may be displayed by a projector connected to the terminal.
  • the method for enabling the voice recognition function of the terminal can reduce the power consumption of the terminal by shortening the voice recognition function of the terminal in time to extend the standby time of the terminal, thereby further improving the user experience.
  • the audio in the recognition result and the display screen can display the characters, pictures or video information in the recognition result through the earphone to further enhance the user's experience.
  • step S51 is performed, step S70 is continued, and details are not described herein again.
  • the present invention further provides a device for enabling the terminal voice recognition function corresponding to the method, which is implemented by the device part.
  • the examples of the method and the method part are mutually responsive, so the device part
  • the device part For the embodiment of the method, please refer to the description of the embodiment of the method section, and details are not described herein.
  • FIG. 8 is a structural diagram of an apparatus for enabling a voice recognition function of a terminal according to an embodiment of the present invention. As shown in Figure 8, the device includes:
  • the receiving module 80 is configured to receive an opening instruction of the voice recognition function that is sent by the earphone.
  • the module 81 is turned on for turning on the voice recognition function for voice recognition.
  • the device for enabling the voice recognition function of the terminal in this embodiment can be operated without the manual operation when the voice recognition function of the terminal is off, and the terminal can only open the module by receiving the opening command sent by the earphone through the receiving module.
  • the voice recognition function is enabled to perform voice recognition.
  • the device can not only reduce the power consumption of the terminal, but also improve the convenience of the user using the voice recognition function, thereby further improving the user experience.
  • an embodiment of the present invention further provides a terminal, where the terminal includes a memory and a processor, and the processor implements a method for turning on the voice recognition function of the terminal provided by any of the foregoing embodiments by calling an instruction stored in the memory.
  • the terminal provided by the embodiment can be used in the state that the voice recognition function of the terminal is off, without manual operation, and the terminal can open its own voice recognition function for voice recognition only by receiving an open command sent by the earphone.
  • the terminal can not only reduce power consumption, but also improve the convenience of the user using the voice recognition function, thereby further improving the user experience.
  • the method and device for opening the voice recognition function of the terminal through the earphone, and the voice recognition of the open terminal provided by the embodiment of the present invention are provided below with reference to the accompanying drawings and the embodiments of the present invention.
  • the above described invention will be further described in detail by the method and device of the function, the earphone and the terminal provided by the embodiment of the present invention.
  • FIG. 9 is a schematic diagram of an application scenario of a headset and a terminal according to an embodiment of the present invention. As shown in FIG. 9, the application process includes:
  • the terminal After receiving the open command, the terminal starts the network voice recognition function and establishes a voice transmission link with the earphone.
  • S93 The user speaks the voice to be recognized to the earphone.
  • the earphone collects the voice to be recognized, and sends the voice to be recognized to the terminal through the voice transmission link.
  • the terminal receives the to-be-identified voice through the voice transmission link, and sends the to-be-identified voice to the network voice recognition server through the network voice recognition module.
  • the network speech recognition server receives the speech to be recognized for recognition, obtains the recognition result, and sends the recognition result to the terminal.
  • S97 The terminal receives the recognition result, and sends the audio of the recognition result to the earphone.
  • the earphone receives the audio of the recognition result and plays the audio to the user.
  • the method and device, the earphone and the terminal provided by the embodiment of the present invention can be seen that the voice recognition function of the terminal is closed in a detailed description of the method and device, the application method of the earphone and the terminal provided by the embodiment of the present invention.
  • the manual voice operation is not required, and the voice recognition function of the terminal can be enabled only by the voice containing the preset keyword, and the voice recognition function of the terminal can be used normally. Therefore, the method and device provided by the embodiment of the present invention and the earphone are provided.
  • the terminal can open and use the voice recognition function of the terminal through voice to save the power consumption of the terminal, thereby improving the convenience of the user using the voice recognition function, and further improving the user experience.
  • the method and device for enabling the voice recognition function of the terminal through the earphone provided by the embodiment of the present invention provided by the embodiment of the present invention, the method and device for opening the voice recognition function of the terminal provided by the embodiment of the present invention, and the earphone and the terminal provided by the embodiment of the present invention Detailed introduction.
  • the various embodiments in the specification are described in a progressive manner, and each embodiment is described as being different from the other embodiments, and the same similar parts between the various embodiments may be referred to each other.

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Telephone Function (AREA)

Abstract

Disclosed is a method for enabling a voice recognition function of a terminal by means of an earphone, comprising: collecting voice; and when it is determined that a preset keyword is comprised in the voice, sending an enabling instruction for enabling the voice recognition function to the terminal so as to make the terminal perform voice recognition. Therefore, by adopting the method, the enabling instruction for enabling the voice recognition function can be sent to the terminal only by means of the voice comprising the preset keyword without manual operations to enable the voice recognition function of the terminal so as to perform voice recognition when the voice recognition function of the terminal is in a closed state, so that the power consumption of the terminal itself is reduced, the convenience of using the voice recognition function of a user is improved, and user experience is improved. In addition, also disclosed are a device for enabling the voice recognition function of the terminal by means of the earphone, the earphone, the method and the device for enabling the voice recognition function of the terminal, and the terminal. The advantages are stated as above.

Description

一种开启终端语音识别功能的方法、装置、耳机及终端Method, device, earphone and terminal for opening terminal voice recognition function
交叉引用cross reference
本申请引用于2017年08月29日递交的名称为“一种开启终端语音识别功能的方法、装置、耳机及终端”的第201710757815.8号中国专利申请,其通过引用被全部并入本申请。The present application is hereby incorporated by reference in its entirety in its entirety in its entirety in its entirety in its entirety in the the the the the the the the the the the the the the the the
技术领域Technical field
本发明涉及语音识别技术领域,特别涉及一种开启终端语音识别功能的方法、装置、耳机及终端。The present invention relates to the field of voice recognition technology, and in particular, to a method, device, earphone and terminal for enabling a voice recognition function of a terminal.
背景技术Background technique
随着电子技术和语音识别技术的发展,已经有越来越多的终端具有语音识别功能,例如手机。With the development of electronic technology and speech recognition technology, more and more terminals have voice recognition functions, such as mobile phones.
虽然上述终端具有了语音识别功能,能够在一定程度上提高终端的使用便捷性,但是,由于终端的语音识别功能一经开启,无论用户是否正在使用该功能,终端均会持续通过麦克风实时地主动获取周围环境中的声音,并当获取到的声音的强度大于阈值时,对获取到的语音进行识别,所以,如果保持终端的语音识别功能处于常开状态,则会造成大量的能量损耗。再加上终端自身的能量限制,所以,现有的具有语音识别功能的终端多是将该功能保持在常关状态,只有在用户通过物理按键或虚拟按键手动开启该功能后,用户才可以正常使用该功能。Although the above terminal has a voice recognition function, the use convenience of the terminal can be improved to some extent. However, since the voice recognition function of the terminal is turned on, the terminal continues to actively acquire the voice in real time through the microphone regardless of whether the user is using the function. The sound in the surrounding environment, and when the intensity of the acquired sound is greater than the threshold, the acquired voice is recognized, so if the voice recognition function of the holding terminal is in the normally open state, a large amount of energy loss is caused. In addition, the terminal has its own energy limitation. Therefore, the existing terminal with voice recognition function mostly keeps the function in the normally closed state. Only after the user manually turns on the function through physical buttons or virtual buttons, the user can normally Use this feature.
而物理按键不仅会占用终端的空间,增加终端体积,从而导致终端不便携带,还会由于多次的物理按压很容易失灵,影响用户的使用,进而降低用户的体验感。而且,当用户双手被占用时,则无法通过物理按键或虚拟按键 手动开启终端的语音识别功能,进一步地影响了用户的使用。当然,用户为了防止双手被占用时无法开启终端的语音识别功能以进行语音识别,还可以选择保持终端的语音识别功能常开,可是,正如上文所述,由于终端的能量有限,在该功能大量损耗终端自身能量的情况下,很容易导致终端电量不足而影响终端的使用,进一步影响该功能的使用。The physical button not only occupies the space of the terminal, but also increases the terminal volume, which leads to the terminal not being carried, and it is easy to malfunction due to multiple physical pressing, which affects the user's use, thereby reducing the user's experience. Moreover, when the user's hands are occupied, they cannot pass physical buttons or virtual buttons. Manually turning on the voice recognition function of the terminal further affects the user's use. Of course, in order to prevent the voice recognition function of the terminal from being opened for the voice recognition when the two hands are occupied, the user may also choose to keep the voice recognition function of the terminal normally open. However, as described above, due to the limited energy of the terminal, the function is In the case of a large loss of the terminal's own energy, it is easy to cause the terminal to have insufficient power and affect the use of the terminal, further affecting the use of the function.
因此,如何在节省终端自身功耗的情况下,提高语音识别功能的使用便捷性,进一步提升用户的体验感是本领域技术人员目前需要解决的技术问题。Therefore, how to improve the convenience of using the voice recognition function and further improve the user experience is a technical problem that a person skilled in the art needs to solve in the case of saving the power consumption of the terminal itself.
发明内容Summary of the invention
本发明的目的是提供一种开启终端语音识别功能的方法、装置、耳机及终端,能够在节省终端自身功耗的情况下,提高语音识别功能的使用便捷性,进一步提升用户的体验感。An object of the present invention is to provide a method, a device, an earphone and a terminal for enabling a voice recognition function of a terminal, which can improve the convenience of use of the voice recognition function and further improve the user experience.
为了解决上述技术问题,本发明提供了一种通过耳机开启终端语音识别功能的方法,所述通过耳机开启终端语音识别功能的方法包括:In order to solve the above technical problem, the present invention provides a method for opening a voice recognition function of a terminal through a headset, and the method for enabling a voice recognition function of a terminal through a headset includes:
采集语音;Collecting speech;
当确定所述语音中包含预设关键词时,向终端发送开启语音识别功能的开启指令以使所述终端进行语音识别。When it is determined that the preset keyword is included in the voice, an open command for turning on the voice recognition function is sent to the terminal to enable the terminal to perform voice recognition.
优选地,在所述向所述终端发送开启语音识别功能的开启指令之后,所述通过耳机开启终端语音识别功能的方法还包括:Preferably, after the sending the opening instruction of the voice recognition function to the terminal, the method for opening the voice recognition function of the terminal through the earphone further includes:
响应所述终端发起的建立语音传输链路的请求;Responding to a request initiated by the terminal to establish a voice transmission link;
当检测到所述语音传输链路接通时,可通过所述语音传输链路传输语音至所述终端。When it is detected that the voice transmission link is turned on, voice can be transmitted to the terminal through the voice transmission link.
优选地,所述通过耳机开启终端语音识别功能的方法还包括:Preferably, the method for enabling the terminal voice recognition function by using the earphone further includes:
当检测到所述终端关闭所述语音传输链路时,停止将语音通过所述语音传输链路传输至所述终端。When detecting that the terminal closes the voice transmission link, stopping transmission of voice to the terminal through the voice transmission link.
优选地,所述向所述终端发送开启语音识别功能的开启指令具体为: Preferably, the opening instruction for sending the voice recognition function to the terminal is specifically:
通过基于BLE协议的指令通道向所述终端发送所述开启指令。The open command is sent to the terminal through an instruction channel based on the BLE protocol.
为了解决上述技术问题,本发明还提供了一种通过耳机开启终端语音识别功能的装置,所述通过耳机开启终端语音识别功能的装置包括:In order to solve the above technical problem, the present invention further provides an apparatus for enabling a voice recognition function of a terminal through a headset, and the apparatus for enabling a voice recognition function of a terminal by using a headset includes:
采集模块,用于采集语音;An acquisition module for collecting voices;
开启模块,用于当确定所述语音中包含预设关键词时,向终端发送开启语音识别功能的开启指令以使所述终端进行语音识别。The opening module is configured to, when determining that the voice includes a preset keyword, send an open command to enable the voice recognition function to enable the terminal to perform voice recognition.
为了解决上述技术问题,本发明还提供了一种耳机,所述耳机包括存储器和处理器,所述处理器通过调用存储于所述存储器中的指令以执行上述任一种通过耳机开启终端语音识别功能的方法。In order to solve the above technical problem, the present invention also provides an earphone, the earphone including a memory and a processor, the processor performing voice recording by using an instruction stored in the memory to perform any of the above-mentioned voice-opening through the earphone Functional method.
为了解决上述技术问题,本发明还提供了一种开启终端语音识别功能的方法,所述开启终端语音识别功能的方法包括:In order to solve the above technical problem, the present invention further provides a method for enabling a voice recognition function of a terminal, where the method for enabling a voice recognition function of the terminal includes:
接收耳机发送的开启语音识别功能的开启指令;Receiving an open command for turning on the voice recognition function sent by the earphone;
开启所述语音识别功能以进行语音识别。The speech recognition function is turned on for speech recognition.
优选地,在所述开启所述语音识别功能后,所述开启终端语音识别功能的方法还包括:Preferably, after the opening the voice recognition function, the method for enabling the voice recognition function of the terminal further includes:
向所述耳机发起建立语音传输链路的请求以建立所述语音传输链路;Initiating a request to establish a voice transmission link to the headset to establish the voice transmission link;
通过所述语音传输链路接收所述耳机发送的语音,并进行语音识别。Receiving voice transmitted by the earphone through the voice transmission link, and performing voice recognition.
优选地,在通过所述语音传输链路接收所述耳机发送的语音后,所述开启终端语音识别功能的方法还包括:Preferably, after the voice sent by the headset is received by the voice transmission link, the method for enabling the voice recognition function of the terminal further includes:
当达到预设关闭条件时,断开所述语音传输链路,并关闭所述语音识别功能;When the preset shutdown condition is reached, disconnecting the voice transmission link and turning off the voice recognition function;
所述预设关闭条件具体包括:The preset shutdown condition specifically includes:
语音的信号强度低于第一预设值且持续时长达到第二预设值;The signal strength of the voice is lower than the first preset value and the duration is up to the second preset value;
或语音的信号强度高于第三预设值,且持续时长达到第四预设值;Or the signal strength of the voice is higher than the third preset value, and the duration is up to the fourth preset value;
或得到语音识别结果;Or get a speech recognition result;
其中,所述第一预设值小于所述第三预设值。The first preset value is smaller than the third preset value.
为了解决上述技术问题,本发明还提供了一种开启终端语音识别功能的 装置,所述开启终端语音识别功能的装置包括:In order to solve the above technical problem, the present invention also provides a voice recognition function for opening a terminal. The device, the device for enabling the voice recognition function of the terminal includes:
接收模块,用于接收耳机发送的开启语音识别功能的开启指令;a receiving module, configured to receive an open instruction for turning on the voice recognition function sent by the earphone;
开启模块,用于开启所述语音识别功能以进行语音识别。The module is turned on for turning on the voice recognition function for voice recognition.
为了解决上述技术问题,本发明还提供了一种终端,所述终端包括存储器和处理器,所述处理器通过调用存储于所述存储器中的指令以执行上述任一种开启终端语音识别功能的方法。In order to solve the above technical problem, the present invention also provides a terminal, the terminal comprising a memory and a processor, the processor performing the above-mentioned voice recognition function of the open terminal by calling an instruction stored in the memory method.
相对于上述现有技术而言,本发明提供的通过耳机开启终端语音识别功能的方法,包括采集语音,并当确定语音中包含预设关键词时,向终端发送开启语音识别功能的开启指令以使终端进行语音识别。由此可见,该方法可以在终端的语音识别功能处于关闭的状态下,无需手动操作,仅通过包含有预设关键词的语音便可以向终端发送开启语音识别功能的开启指令以开启终端的语音识别功能进行语音识别,因此,本发明提供的方法能够通过耳机进行语音识别,并自动开启终端的语音识别功能,使得终端无需长时间开启语音识别功能,不仅能够降低终端的功耗,而且能够提高用户使用语音识别功能的便捷性,进一步提升用户的体验感。此外,本发明还提供了一种通过耳机开启终端语音识别功能的装置和一种耳机,效果如上。With respect to the above prior art, the method for enabling a voice recognition function of a terminal through a headset includes collecting voice, and when determining that the voice includes a preset keyword, sending an open command for turning on the voice recognition function to the terminal Make the terminal perform speech recognition. It can be seen that the method can be used in the state that the voice recognition function of the terminal is off, without manual operation, and only the voice containing the preset keyword can be used to send an open command for turning on the voice recognition function to open the voice of the terminal. The recognition function performs voice recognition. Therefore, the method provided by the present invention can perform voice recognition through the earphone and automatically turn on the voice recognition function of the terminal, so that the terminal does not need to open the voice recognition function for a long time, which can not only reduce the power consumption of the terminal, but also improve the performance. Users use the convenience of voice recognition to further enhance the user experience. In addition, the present invention also provides a device for opening a terminal voice recognition function through a headphone and an earphone, the effect is as above.
另外,本发明还提供了一种开启终端语音识别功能的方法,包括接收耳机发送的开启语音识别功能的开启指令;开启语音识别功能以进行语音识别。由此可见,本发明提供的方法可以在终端的语音识别功能处于关闭的状态下,无需手动操作,终端仅通过接收耳机发送的开启指令,便可以开启自身的语音识别功能以进行语音识别。相对于现有技术而言,本方法不仅能够降低终端的功耗,而且能够提高用户使用语音识别功能的便捷性,进一步提升用户的体验感。此外,本发明还提供了一种开启终端语音识别功能的装置和一种终端,效果如上。 In addition, the present invention also provides a method for turning on the voice recognition function of the terminal, comprising: receiving an opening instruction of the voice recognition function sent by the earphone; and opening the voice recognition function to perform voice recognition. It can be seen that the method provided by the present invention can be used in the state that the voice recognition function of the terminal is off, without manual operation, and the terminal can open its own voice recognition function for voice recognition only by receiving an open command sent by the earphone. Compared with the prior art, the method can not only reduce the power consumption of the terminal, but also improve the convenience of the user using the voice recognition function, thereby further improving the user experience. In addition, the present invention also provides an apparatus for opening a voice recognition function of a terminal and a terminal, the effect is as above.
附图说明DRAWINGS
为了更清楚地说明本发明实施例,下面将对实施例中所需要使用的附图做简单的介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他附图。In order to explain the embodiments of the present invention more clearly, the drawings, which are used in the embodiments, will be briefly described below. It is obvious that the drawings in the following description are only some embodiments of the present invention. In terms of personnel, other drawings can be obtained based on these drawings without any creative work.
图1为本发明实施例提供的一种通过耳机开启终端语音识别功能的方法的流程图;FIG. 1 is a flowchart of a method for enabling a voice recognition function of a terminal through a headset according to an embodiment of the present invention;
图2为本发明实施例提供的另一种通过耳机开启终端语音识别功能的方法的流程图;2 is a flowchart of another method for enabling a voice recognition function of a terminal through a headset according to an embodiment of the present invention;
图3为本发明实施例提供的另一种通过耳机开启终端语音识别功能的方法的流程图;FIG. 3 is a flowchart of another method for enabling a voice recognition function of a terminal through a headset according to an embodiment of the present invention;
图4本发明实施例提供的一种通过耳机开启终端语音识别功能的装置的结构图;FIG. 4 is a structural diagram of an apparatus for enabling a voice recognition function of a terminal through a headset according to an embodiment of the present invention;
图5为本发明实施例提供的一种开启终端语音识别功能的方法的流程图;FIG. 5 is a flowchart of a method for enabling a voice recognition function of a terminal according to an embodiment of the present invention;
图6为本发明实施例提供的另一种开启终端语音识别功能的方法的流程图;FIG. 6 is a flowchart of another method for enabling a voice recognition function of a terminal according to an embodiment of the present invention;
图7为本发明实施例提供的另一种开启终端语音识别功能的方法的流程图;FIG. 7 is a flowchart of another method for enabling a voice recognition function of a terminal according to an embodiment of the present invention;
图8为本发明实施例提供的一种开启终端语音识别功能的装置的结构图;FIG. 8 is a structural diagram of an apparatus for enabling a voice recognition function of a terminal according to an embodiment of the present invention;
图9为本发明实施例提供的耳机和终端的应用场景示意图。FIG. 9 is a schematic diagram of an application scenario of a headset and a terminal according to an embodiment of the present invention.
具体实施方式Detailed ways
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部实施例。基于本发明中的实施例,本领域普通技术人员在没有付出 创造性劳动的前提下,所获得的所有其他实施例,都属于本发明保护范围。The technical solutions in the embodiments of the present invention are clearly and completely described in the following with reference to the accompanying drawings in the embodiments of the present invention. It is obvious that the described embodiments are only a part of the embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, those of ordinary skill in the art are not paying All other embodiments obtained under the premise of creative labor are within the scope of the present invention.
本发明的目的是提供一种语音识别的方法,能够提高用户使用语音识别功能的便捷性,从而提升用户的体验感。An object of the present invention is to provide a method for voice recognition, which can improve the convenience of a user using a voice recognition function, thereby improving the user experience.
为了使本领域的技术人员更好的理解本发明技术方案,下面结合附图和具体实施方式对本发明作进一步的详细说明。The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.
需要说明的是,本发明中提到的耳机需要与终端建立通信连接,而建立通信连接所需的通信模块可以为蓝牙模块,或者其它类型的通信模块,只要耳机和终端的通信模块匹配即可。另外,终端可以为手机等其他类型的电子产品。It should be noted that the earphone mentioned in the present invention needs to establish a communication connection with the terminal, and the communication module required for establishing the communication connection may be a Bluetooth module or another type of communication module, as long as the communication module of the earphone and the terminal matches. . In addition, the terminal can be other types of electronic products such as mobile phones.
图1为本发明实施例提供的一种通过耳机开启终端语音识别功能的方法的流程图。如图1所示,该方法包括:FIG. 1 is a flowchart of a method for enabling a voice recognition function of a terminal through a headset according to an embodiment of the present invention. As shown in Figure 1, the method includes:
S10:采集语音。S10: Collecting voice.
S11:当确定语音中包含预设关键词时,向终端发送开启语音识别功能的开启指令以使终端进行语音识别。S11: When it is determined that the preset keyword is included in the voice, the terminal sends an open command for turning on the voice recognition function to enable the terminal to perform voice recognition.
在具体实施中,耳机的语音采集模块实时采集用户输入的语音,再通过本地语音识别模块识别采集到的语音是否包含有预设关键词,如果有,则向终端发送开启指令。In a specific implementation, the voice collection module of the earphone collects the voice input by the user in real time, and then the local voice recognition module identifies whether the collected voice includes a preset keyword, and if yes, sends an open command to the terminal.
需要说明的是,耳机识别预设关键词的具体过程是:将采集到的语音转化为文本,再将该文本与数据库中的预设关键词的文本作比对,如果该文本中包含有与预设关键词的文本相同的文本,则确定耳机采集到的语音中包含预设关键词,如果该文本中没有与预设关键词相同的文本,则确定耳机采集到的语音中没有预设关键词。It should be noted that the specific process of the headset recognizing the preset keyword is: converting the collected voice into text, and then comparing the text with the text of the preset keyword in the database, if the text includes If the text of the preset keyword has the same text, it is determined that the voice collected by the earphone includes a preset keyword. If there is no text in the text that is the same as the preset keyword, it is determined that there is no preset key in the voice collected by the earphone. word.
在步骤S11中提到的预设关键词是指开启终端的语音识别功能的关键词。而且,可以理解的是,预设关键词所包含的内容没有限制,可以为字母,数字,汉语等,预设关键词中包含的内容越多,则识别过程会较慢,但是能够有效防止误操作,因此,可以根据可以实际情况选择,本发明不作限定。The preset keyword mentioned in step S11 refers to a keyword that turns on the voice recognition function of the terminal. Moreover, it can be understood that the content of the preset keyword is not limited, and may be letters, numbers, Chinese, etc. The more content included in the preset keyword, the slower the recognition process, but the error can be effectively prevented. The operation can be selected according to the actual situation, and the invention is not limited.
例如,预设关键词为“开启语音识别功能”,那么当用户向耳机说出“我 要开启语音识别功能”的语音(该语音中包含了预设关键词“开启语音识别功能”)时,耳机的第一语音采集模块就会采集到用户向耳机说出的语音,并且由耳机的本地语音识别模块对该语音进行识别,最终确定出该语音中包含预设关键词(预设关键词为“开启语音识别功能”),向终端发送开启语音识别功能的开启指令,那么终端在接收到开启指令后便会开启语音识别功能进行语音识别。For example, if the default keyword is "turn on speech recognition", then when the user says "I am to the headset" When the voice recognition function is turned on (the voice includes the preset keyword "turn on the voice recognition function"), the first voice collection module of the earphone collects the voice spoken by the user to the earphone, and is performed by the earphone. The local voice recognition module identifies the voice, and finally determines that the voice includes a preset keyword (the preset keyword is “turning on the voice recognition function”), and sends an open command for turning on the voice recognition function to the terminal, then the terminal receives the voice. After the command is turned on, the voice recognition function is turned on for voice recognition.
当然,在具体实施中,开启语音识别功能的预设关键词可以为一个,也可以为多个,且如果预设关键词为多个,则数据库中就包括了多个预设关键词的文本,那么用户输入包含任意一个预设关键词的语音都可以实现开启终端语音识别功能的目的。另外,在一个具体实施中,可以设置两个预设关键词,分别表征开启终端的本地语音识别功能的关键词和开启终端的网络语音识别功能的关键词。则对应的,开启指令中携带有对应的信息,例如将开启指令对应的数据的头部设置为1,表示采用终端本地语音识别功能;将开启指令对应的数据的头部设置为0,采用网络语音识别功能。具体地,当耳机接收到包含有开启终端的本地语音识别功能的关键词时,向终端发送开启终端的本地语音识别功能的开启指令;当耳机接收到包含有开启终端的网络语音识别功能的关键词时,向终端发送开启终端的网络语音识别功能的开启指令。Certainly, in a specific implementation, the preset keyword for enabling the voice recognition function may be one or more, and if there are multiple preset keywords, the database includes texts of multiple preset keywords. Then, the user inputs a voice containing any preset keyword to achieve the purpose of turning on the terminal voice recognition function. In addition, in a specific implementation, two preset keywords may be set to respectively represent keywords of the local voice recognition function of the terminal and keywords of the network voice recognition function of the terminal. Correspondingly, the open command carries corresponding information, for example, the head of the data corresponding to the open command is set to 1, indicating that the terminal local voice recognition function is adopted; the head of the data corresponding to the open command is set to 0, and the network is adopted. Speech recognition. Specifically, when the headset receives the keyword that includes the local voice recognition function of the terminal, the terminal sends an open command for opening the local voice recognition function of the terminal; when the headset receives the key of the network voice recognition function including the enabled terminal In the case of a word, an open command to turn on the network voice recognition function of the terminal is sent to the terminal.
由此可见,本发明实施例提供的通过耳机开启终端语音识别功能的方法可以在终端的语音识别功能处于关闭的状态下,无需手动操作,仅通过包含有预设关键词的语音便可以向终端发送开启语音识别功能的开启指令以开启终端的语音识别功能进行语音识别,因此,本方法不仅能够降低终端的功耗,而且能够提高用户使用语音识别功能的便捷性,进一步提升用户的体验感。It can be seen that the method for enabling the voice recognition function of the terminal through the earphone provided by the embodiment of the present invention can be performed in the state that the voice recognition function of the terminal is off, without manual operation, and can only be transmitted to the terminal through the voice containing the preset keyword. Sending the open command of the voice recognition function to enable the voice recognition function of the terminal for voice recognition. Therefore, the method can not only reduce the power consumption of the terminal, but also improve the convenience of the user using the voice recognition function, thereby further improving the user experience.
在具体实施中,当用户在距离终端较远的地方说出待识别语音时,通过终端自身的麦克风获取到的待识别语音的质量是差的,从而会导致语音识别的成功率降低,影响用户体验。当然,用户可以将终端移动至与自身的距离较小的地方以便终端可以获取到质量较好的待识别语音,但是由于终端的便 携性较差,当用户的双手被占用时,用户则很难保证自身与终端的距离。有鉴于此,本发明考虑到耳机的便携性较好,只要用户戴着耳机,即便用户在距离终端较远的地方说出待识别语音,终端也可以通过耳机采集到质量较高的待识别语音,所以,在耳机向终端发送了开启指令之后,终端还可以建立与耳机之间的语音传输链路,以便终端能够通过语音传输链路获取由便携性较好的耳机采集到的高质量待识别语音。In a specific implementation, when the user speaks the voice to be recognized far away from the terminal, the quality of the voice to be recognized obtained by the microphone of the terminal itself is poor, which may result in a decrease in the success rate of voice recognition and affect the user. Experience. Of course, the user can move the terminal to a place with a small distance from itself so that the terminal can obtain a better-quality voice to be recognized, but because of the terminal The portability is poor. When the user's hands are occupied, it is difficult for the user to ensure the distance between the user and the terminal. In view of this, the present invention considers that the earphone has better portability. As long as the user wears the earphone, even if the user speaks the voice to be recognized far away from the terminal, the terminal can collect the higher-quality voice to be recognized through the earphone. Therefore, after the headset sends an open command to the terminal, the terminal can also establish a voice transmission link with the earphone, so that the terminal can acquire the high quality to be recognized by the portable headset through the voice transmission link. voice.
图2为本发明实施例提供的另一种通过耳机开启终端语音识别功能的方法的流程图。本实施例中,耳机语音传输链路向终端传输自身采集到的高质量待识别语音,如图2所示,作为优选地实施方式,在图1的基础上,执行完步骤S11后,还包括:FIG. 2 is a flowchart of another method for enabling a voice recognition function of a terminal through a headset according to an embodiment of the present invention. In this embodiment, the headset voice transmission link transmits the high-quality to-be-identified voice collected by the headset to the terminal, as shown in FIG. 2 . As a preferred implementation manner, after performing step S11 on the basis of FIG. :
S20:响应终端发起的建立语音传输链路的请求。S20: Respond to the request initiated by the terminal to establish a voice transmission link.
S21:当检测到语音传输链路接通时,可通过语音传输链路传输语音至终端。S21: When it is detected that the voice transmission link is connected, the voice can be transmitted to the terminal through the voice transmission link.
需要说明的是,上述语音传输链路是指终端与耳机之间传输待识别语音的传输链路,步骤S21中提到的语音是由耳机在发送完开启指令之后,当检测到语音传输链路接通时采集到的语音,与步骤S10中的提到的语音为不同时刻的语音,具体地,步骤S10中提到的语音为在语音传输链路接通之前用户说出的语音;步骤S21中提到的语音为语音传输链路接通后用户说出的语音。例如,在当前时刻,耳机没有检测到语音传输链路已接通,则耳机实时采集的语音为步骤S10中的语音。而如果在下一时刻,耳机的检测到语音传输链路已经接通,则耳机实时采集到的语音为步骤S21中的语音。It should be noted that the voice transmission link refers to a transmission link between the terminal and the earphone for transmitting the voice to be recognized, and the voice mentioned in step S21 is after the voice transmission link is detected by the earphone after the opening command is sent. The voice collected at the time of the connection is the voice at the different time from the voice mentioned in the step S10. Specifically, the voice mentioned in the step S10 is the voice spoken by the user before the voice transmission link is turned on; step S21 The voice mentioned in the voice is the voice spoken by the user after the voice transmission link is connected. For example, at the current moment, the headset does not detect that the voice transmission link is turned on, and the voice collected by the headset in real time is the voice in step S10. If, at the next moment, the headset detects that the voice transmission link has been turned on, the voice collected by the headset in real time is the voice in step S21.
在具体实施中,当耳机接收到终端向自身发起建立语音传输链路的请求时,耳机则立即响应终端发起的建立语音传输链路的请求以便终端尽快建立两者之间的语音传输链路。并且当耳机检测到自身与终端之间的语音传输链路接通时,耳机则开始将自身实时采集到语音通过语音传输链路实时传输至终端。In a specific implementation, when the headset receives the request for the terminal to initiate a voice transmission link, the headset immediately responds to the request initiated by the terminal to establish a voice transmission link, so that the terminal establishes a voice transmission link between the two as soon as possible. And when the headset detects that the voice transmission link between itself and the terminal is connected, the headset starts to collect the voice in real time and the voice is transmitted to the terminal in real time through the voice transmission link.
但是,值得注意的是,对于步骤S21来说,当用户输入待识别语音的时 机不对时,很可能会导致语音识别失败,例如,在用户输入包含有预设关键词的语音后,立即输出待识别语音时,很可能会由于终端还没有完全开启语音识别功能或还没有建立好语音传输链路,而导致终端无法获取到用户在终端开启语音识别功能的时候和在终端建立语音传输链路的时候说出的待识别语音,致使语音识别失败。因此,为了使用户能够准确把输入待识别语音的时机,从而提高语音识别的成功率,进而提升用户的体验感,在其他实施例中还可以采用如下方式。However, it is worth noting that, for step S21, when the user inputs the speech to be recognized If the machine is not correct, it may cause speech recognition to fail. For example, when the user inputs the voice containing the preset keyword and immediately outputs the voice to be recognized, it is likely that the terminal has not fully turned on the voice recognition function or has not been established yet. The voice transmission link is good, and the terminal cannot obtain the voice to be recognized when the user turns on the voice recognition function and the voice transmission link is established when the terminal establishes the voice transmission link, so that the voice recognition fails. Therefore, in order to enable the user to accurately input the timing of the voice to be recognized, thereby improving the success rate of the voice recognition, thereby improving the user experience, the following manner may also be adopted in other embodiments.
作为优选地实施方式,当耳机检测到语音传输链路接通,并接收到终端发送的提示指令时,耳机向用户播放开始输入待识别语音的提示信号以帮助用户准确把握输入待识别语音的时机,且耳机可通过语音传输链路传输自身采集到的语音至终端,使得终端能够获得全部的待识别语音以提高语音识别的成功率,进而提升用户的体验感。As a preferred implementation manner, when the headset detects that the voice transmission link is turned on and receives the prompt command sent by the terminal, the headset plays a prompt signal to the user to start inputting the voice to be recognized to help the user accurately grasp the timing of inputting the voice to be recognized. And the earphone can transmit the voice collected by the earphone to the terminal through the voice transmission link, so that the terminal can obtain all the voices to be recognized to improve the success rate of the voice recognition, thereby improving the user experience.
由此可见,本实施例提供的方法可以通过便携性较好的耳机响应终端发起的建立语音传输链路的请求,并且当检测到语音传输链路接通时,耳机可通过语音传输链路实时传输自身采集到的语音至终端,使得终端可以通过便携性较好耳机获取到高质量待识别语音,避免了由于终端无法通过自身的麦克风获取高质量待识别语音而导致的终端无法成功识别出用户想要识别的语音为影响用户体验。因此,本方法能够提高语音识别的成功率,进而提升用户的体验感。It can be seen that the method provided by the embodiment can respond to the request initiated by the terminal to establish a voice transmission link by using a headset with better portability, and when the voice transmission link is detected to be connected, the earphone can be real-time through the voice transmission link. The voice collected by the user is transmitted to the terminal, so that the terminal can obtain the high-quality to-be-recognized voice through the better-capable earphone, and the terminal cannot successfully identify the user because the terminal cannot obtain the high-quality to-be-recognized voice through the microphone. The voice you want to recognize is the impact of the user experience. Therefore, the method can improve the success rate of voice recognition, thereby improving the user experience.
在具体实施中,当终端成功获取到待识别语音或成功获得识别结果后,终端会关闭语音传输链路,此时,耳机则可以停止向终端传输采集到的语音以降低耳机自身的功耗。因此,本发明实施例提供了另一种通过耳机开启终端语音识别功能的方法,在图2的基础上作进一步改进,下面结合附图进行详细说明。In a specific implementation, after the terminal successfully obtains the to-be-recognized voice or successfully obtains the recognition result, the terminal closes the voice transmission link. At this time, the earphone may stop transmitting the collected voice to the terminal to reduce the power consumption of the earphone itself. Therefore, the embodiment of the present invention provides another method for opening the voice recognition function of the terminal through the earphone, which is further improved on the basis of FIG. 2, and is described in detail below with reference to the accompanying drawings.
图3为本发明实施例提供的另一种通过耳机开启终端语音识别功能的方法的流程图。如图3所示,为了降低耳机自身的功耗,作为优选地实施方式, 在图2的基础上,执行完步骤S21后,还包括:FIG. 3 is a flowchart of another method for enabling a voice recognition function of a terminal through a headset according to an embodiment of the present invention. As shown in FIG. 3, in order to reduce the power consumption of the earphone itself, as a preferred embodiment, On the basis of FIG. 2, after performing step S21, the method further includes:
S30:当检测到终端关闭语音传输链路时,停止将语音通过语音传输链路传输至终端。S30: When detecting that the terminal closes the voice transmission link, stop transmitting the voice to the terminal through the voice transmission link.
值得注意的是,虽然在步骤S30中停止了将语音通过语音传输链路传输至终端,但耳机仍然在实时采集语音,此时对于耳机来说,采集得到的语音是用于判断是否包含有预设关键词,以便用户再次通过语音识别功能开启终端的语音识别功能。另外,需要说明的是,耳机在工作过程中,虽然在实时采集语音,但是采集到的语音的用途是不同的,即在在一个完整周期内,未向终端发送开启指令之前,采集到的语音是用来判断是否包含预设关键词,而向终端发送完开启指令之后,采集到的语音是用来传输至终端以进行语音识别。因此,在具体实施中,耳机是包括有两个采集模块,分别为第一采集模块和第二采集模块,两个采集模块是不同时工作的。It should be noted that although the voice is transmitted to the terminal through the voice transmission link in step S30, the earphone still collects voice in real time. At this time, for the earphone, the collected voice is used to determine whether or not the voice is included. Set keywords so that the user can turn on the voice recognition function of the terminal again through the voice recognition function. In addition, it should be noted that during the operation of the earphone, although the voice is collected in real time, the used voice is used for different purposes, that is, the voice collected before the open command is sent to the terminal in a complete cycle. It is used to judge whether the preset keyword is included, and after the opening command is sent to the terminal, the collected voice is used for transmission to the terminal for voice recognition. Therefore, in a specific implementation, the earphone includes two acquisition modules, which are a first acquisition module and a second collection module, and the two acquisition modules do not work at the same time.
而且,为了更进一步完善本实施例提供的通过耳机开启终端的语音识别功能的方法,提升用户的体验感,在执行完步骤S30后,耳机还可以接收终端发送的识别结果中的音频,并播放该音频。Moreover, in order to further improve the method for the voice recognition function of the terminal to be opened by the earphone provided by the embodiment, the user's experience is improved, and after performing step S30, the earphone can also receive the audio in the recognition result sent by the terminal, and play The audio.
当然,可以理解的是,还可以在图1的基础上对本发明实施例提供的通过耳机开启终端语音识别功能的方法作进一步的改进,即在执行完步骤S11后,继续执行步骤S30,本文不再详细赘述。Of course, it can be understood that the method for opening the voice recognition function of the terminal through the earphone provided by the embodiment of the present invention can be further improved on the basis of FIG. 1 , that is, after step S11 is performed, step S30 is continued, More details will be described.
为了降低耳机自身的功耗,作为优选地实施方式,向终端发送开启语音识别功能的开启指令具体为:通过基于BLE协议的指令通道向终端发送开启指令。除了基于BLE协议这一种方式,还可以采用其它通信协议,本实施例不再赘述。In order to reduce the power consumption of the earphone itself, as a preferred embodiment, the opening command for transmitting the voice recognition function to the terminal is specifically: sending an open command to the terminal through the command channel based on the BLE protocol. Other communication protocols may be used in addition to the BLE protocol, and are not described in this embodiment.
上文对于本发明提供的一种通过耳机开启终端语音识别功能的方法的实施例进行了详细的描述,本发明还提供了一种与该方法对应的通过耳机开启终端语音识别功能的装置,由于装置部分的实施例与方法部分的实施例相互照应,因此装置部分的实施例请参见方法部分的实施例的描述,这里暂不赘 述。The foregoing provides a detailed description of an embodiment of a method for enabling a voice recognition function of a terminal through a headset, and the present invention further provides a device for enabling a voice recognition function of a terminal through a headset corresponding to the method, The embodiment of the device part and the embodiment of the method part are mutually responsive, so the embodiment of the device part can be referred to the description of the embodiment of the method part. Said.
图4为本发明实施例提供的一种通过耳机开启终端语音识别功能的装置的结构图。如图4所示,该装置包括:FIG. 4 is a structural diagram of an apparatus for enabling a voice recognition function of a terminal through a headset according to an embodiment of the present invention. As shown in Figure 4, the device comprises:
采集模块40,用于采集语音。The acquisition module 40 is configured to collect voice.
开启模块41,用于当确定语音中包含预设关键词时,向终端发送开启语音识别功能的开启指令以使终端进行语音识别。The opening module 41 is configured to send an opening instruction for turning on the voice recognition function to the terminal to perform voice recognition when determining that the voice includes the preset keyword.
其中,采集模块40包括第一语音采集模块和第二语音采集模块,且第一语音采集模块和第二语音采集模块均用于实时采集语音,但是第一语音采集模块采集到的语音作为被耳机的本地语音识别模块识别的语音;而第二语音采集模块采集到的语音则作为被耳机传输至终端的语音以便终端对该语音进行语音识别。一般地,为了降低耳机自身的功耗,在耳机没有向终端传输语音的时候,只保持第一语音采集模块处于开启状态实时采集语音以便耳机的本地语音识别模块实时识别第一语音采集模块采集到的语音中是否包含预设的关键词,而将第二语音采集模块保持在关闭状态。需要说明的是,在一般情况下,第二语音采集模块只有在耳机自身的本地语音识别模块识别出第一语音采集模块采集到的语音中包含有预设关键词后,当检测到语音传输链路接通时才会开启,并且在第二语音采集模块开启时,第一语音采集模块会同时关闭。当然,如果耳机中还存储有包含有其它指令的关键词,也可以在第二语音采集模块开启时依旧保持第一语音采集模块处于开启状态。The collection module 40 includes a first voice collection module and a second voice collection module, and the first voice collection module and the second voice collection module are used to collect voice in real time, but the voice collected by the first voice collection module is used as a headset. The voice recognized by the local voice recognition module; and the voice collected by the second voice collection module is used as the voice transmitted by the earphone to the terminal for the terminal to perform voice recognition on the voice. Generally, in order to reduce the power consumption of the earphone itself, when the earphone does not transmit voice to the terminal, only the first voice collection module is kept in an open state to collect voice in real time, so that the local voice recognition module of the earphone recognizes the first voice collection module in real time. Whether the preset voice is included in the voice, and the second voice collection module is kept in the off state. It should be noted that, in a normal case, the second voice collection module only detects the voice transmission chain after the local voice recognition module of the earphone itself recognizes that the voice collected by the first voice collection module includes a preset keyword. When the road is connected, it will be turned on, and when the second voice collection module is turned on, the first voice collection module will be turned off at the same time. Of course, if the keyword is also stored in the headset, the first voice collection module can still be kept on when the second voice collection module is turned on.
而且,可以理解的是,在当检测到语音传输链路已经关闭时,耳机将关闭第二语音采集模块以节省自身的功耗,并同时开启自身的第一语音采集模块,以便用户在下一次需要使用终端的语音识别功能的时候,依旧能够通过语音更加便捷地开启终端的语音识别功能,以提升用户的体验感。但是,值得注意的是,如果耳机中还存储有包含有其它指令的关键词,在第二语音采集模块开启时并未关闭第一语音采集模块,而是将第一语音采集模块依旧保持在开启状态的话,那么在耳机关闭第二语音采集模块时,则无需再次开启第一语音采集模块。 Moreover, it can be understood that when detecting that the voice transmission link has been turned off, the headset will turn off the second voice collection module to save its own power consumption, and simultaneously turn on its own first voice collection module, so that the user needs the next time. When the voice recognition function of the terminal is used, the voice recognition function of the terminal can be opened more conveniently by voice, thereby improving the user experience. However, it is worth noting that if the keyword is also stored in the headset, the first voice collection module is not turned off when the second voice collection module is turned on, but the first voice collection module is still turned on. In the state, when the second voice collection module is turned off by the earphone, it is not necessary to turn on the first voice collection module again.
本实施例提供的通过耳机开启终端语音识别功能的装置可以在终端的语音识别功能处于关闭的状态下,无需手动操作,仅通过采集模块采集到的包含有预设关键词的语音便可以由开启模块向终端发送开启语音识别功能的开启指令以开启终端的语音识别功能进行语音识别,因此,本装置能够通过耳机进行语音识别,并自动开启终端的语音识别功能,使得终端无需长时间开启语音识别功能,不仅能够降低终端的功耗,而且能够提高用户使用语音识别功能的便捷性,进一步提升用户的体验感。The device for enabling the voice recognition function of the terminal through the earphone provided by the embodiment may be in a state where the voice recognition function of the terminal is off, without manual operation, and only the voice containing the preset keyword collected by the collection module may be turned on. The module sends an open command for turning on the voice recognition function to the terminal to enable the voice recognition function of the terminal for voice recognition. Therefore, the device can perform voice recognition through the earphone and automatically turn on the voice recognition function of the terminal, so that the terminal does not need to open the voice recognition for a long time. The function not only can reduce the power consumption of the terminal, but also can improve the convenience of the user to use the voice recognition function, thereby further improving the user experience.
此外,本发明实施例还提供了一种耳机,该耳机包括存储器和处理器,处理器通过调用存储于存储器中的指令以实现上述任一实施例提供的方法。需要说明的是,耳机除了上述器件外,还包括耳机本体,例如通信模块、电池、扬声器、听筒等。In addition, an embodiment of the present invention further provides an earphone including a memory and a processor, and the processor implements the method provided by any of the foregoing embodiments by calling an instruction stored in the memory. It should be noted that, in addition to the above devices, the earphone includes a headphone body, such as a communication module, a battery, a speaker, an earpiece, and the like.
由此可见,本实施例提供的耳机可以在终端的语音识别功能处于关闭的状态下,无需手动操作,仅通过包含有预设关键词的语音便可以向终端发送开启语音识别功能的开启指令以开启终端的语音识别功能进行语音识别,因此,本耳机能够通过包含有预设关键词的语音自动向终端发送开启语音识别功能的指令以使终端自动开启自身的语音识别功能,使得终端无需长时间开启语音识别功能,不仅能够降低终端的功耗,而且能够提高用户使用语音识别功能的便捷性,进一步提升用户的体验感。It can be seen that the earphone provided by the embodiment can be used in the state that the voice recognition function of the terminal is off, and no manual operation is required. Only the voice containing the preset keyword can be used to send an open command for turning on the voice recognition function to the terminal. The voice recognition function of the terminal is enabled to perform voice recognition. Therefore, the headset can automatically send an instruction to turn on the voice recognition function to the terminal through the voice containing the preset keyword, so that the terminal automatically turns on the voice recognition function of the terminal, so that the terminal does not need to be long. Turning on the voice recognition function not only reduces the power consumption of the terminal, but also improves the convenience of the user using the voice recognition function, thereby further improving the user experience.
图5为本发明实施例提供的一种开启终端语音识别功能的方法的流程图。如图5所示,该方法包括:FIG. 5 is a flowchart of a method for enabling a voice recognition function of a terminal according to an embodiment of the present invention. As shown in FIG. 5, the method includes:
S50:接收耳机发送的开启语音识别功能的开启指令。S50: Receive an open command for turning on the voice recognition function sent by the earphone.
S51:开启语音识别功能以进行语音识别。S51: Turn on the voice recognition function for voice recognition.
需要说明的是,当终端接收到耳机发送的开启语音识别功能的开启指令时,则立即开启自身的语音识别功能以进行语音识别。It should be noted that when the terminal receives the opening instruction of the voice recognition function that is sent by the earphone, the terminal immediately turns on its own voice recognition function to perform voice recognition.
而且,可以理解的是,如果终端接收到的是耳机发送的开启本地语音识 别功能的开启指令,则终端开启自身的本地语音识别功能,如果终端接收到的是耳机发送的开启网络语音识别功能的开启指令,则终端开启自身的网络语音识别功能。优选地,当终端接收到的是耳机发送的开启语音识别功能的开启指令时,则终端默认开启本地语音识别功能识别用户输入的语音,并且当本地语音识别功能无法成功识别用户输入的语音时,终端则关闭本地语音识别功能并同时开启网络语音识别功能以识别用户输入的语音。Moreover, it can be understood that if the terminal receives the local voice recognition sent by the earphone When the function is turned on, the terminal turns on its own local voice recognition function. If the terminal receives the opening command of the network voice recognition function sent by the headset, the terminal turns on its own network voice recognition function. Preferably, when the terminal receives the opening instruction of the voice recognition function that is sent by the earphone, the terminal turns on the local voice recognition function to recognize the voice input by the user by default, and when the local voice recognition function cannot successfully recognize the voice input by the user, The terminal turns off the local voice recognition function and simultaneously turns on the network voice recognition function to recognize the voice input by the user.
由此可见,本实施例提供的开启终端语音识别功能的方法可以在终端的语音识别功能处于关闭的状态下,无需手动操作,终端仅通过接收耳机发送的开启指令,便可以开启自身的语音识别功能以进行语音识别。相对于现有技术而言,本方法不仅能够降低终端的功耗,而且能够提高用户使用语音识别功能的便捷性,进一步提升用户的体验感。Therefore, the method for enabling the voice recognition function of the terminal in the embodiment can be performed in the state that the voice recognition function of the terminal is off, without manual operation, and the terminal can start its own voice recognition only by receiving an open command sent by the earphone. Features for speech recognition. Compared with the prior art, the method can not only reduce the power consumption of the terminal, but also improve the convenience of the user using the voice recognition function, thereby further improving the user experience.
在具体实施中,当用户在距离终端较远的地方说出待识别语音时,通过终端自身的麦克风获取到的待识别语音的质量是差的,从而会导致语音识别的成功率降低,影响用户体验。当然,用户可以将终端移动至与自身的距离较小的地方以便终端可以获取到质量较好的待识别语音,但是由于终端的便携性较差,当用户的双手被占用时,用户则很难保证自身与终端的距离。有鉴于此,本发明考虑到耳机的便携性较好,只要用户戴着耳机,即便用户在距离终端较远的地方说出待识别语音,终端也可以通过耳机采集到质量较高的待识别语音,所以,在耳机向终端发送了开启指令之后,终端还可以建立与耳机之间的语音传输链路,以通过该语音传输链路获取由便携性较好的耳机采集到的高质量待识别语音。In a specific implementation, when the user speaks the voice to be recognized far away from the terminal, the quality of the voice to be recognized obtained by the microphone of the terminal itself is poor, which may result in a decrease in the success rate of voice recognition and affect the user. Experience. Of course, the user can move the terminal to a place with a small distance from the user so that the terminal can obtain a better-quality voice to be recognized, but because the portability of the terminal is poor, when the user's hands are occupied, the user is difficult. Ensure the distance between itself and the terminal. In view of this, the present invention considers that the earphone has better portability. As long as the user wears the earphone, even if the user speaks the voice to be recognized far away from the terminal, the terminal can collect the higher-quality voice to be recognized through the earphone. Therefore, after the headset sends an open command to the terminal, the terminal can also establish a voice transmission link with the earphone to obtain high quality to-be-recognized voice collected by the portable headset through the voice transmission link. .
图6为本发明实施例提供的另一种开启终端语音识别功能的方法的流程图。本实施例中,终端通过语音传输链路获取耳机采集到的高质量待识别语音,如图6所示,作为优选地实施方式,在图5的基础上,在执行完步骤S51后,还包括:FIG. 6 is a flowchart of another method for enabling a voice recognition function of a terminal according to an embodiment of the present invention. In this embodiment, the terminal obtains the high-quality to-be-recognized voice collected by the earphone through the voice transmission link, as shown in FIG. 6. As a preferred implementation manner, on the basis of FIG. 5, after performing step S51, the terminal further includes :
S60:向耳机发起建立语音传输链路的请求以建立语音传输链路。 S60: Initiating a request to establish a voice transmission link to the headset to establish a voice transmission link.
S61:通过语音传输链路接收耳机发送的语音,并进行语音识别。S61: Receive voice sent by the earphone through a voice transmission link, and perform voice recognition.
需要说明的是,上述语音传输链路是指终端与耳机之前的传输待识别语音的传输链路,步骤S61中提到的语音是在终端开启语音识别功能之后,在语音传输链路建立成功后,由耳机采集到的语音。It should be noted that the voice transmission link refers to a transmission link for transmitting a voice to be recognized before the terminal and the earphone, and the voice mentioned in step S61 is after the voice recognition function is enabled after the terminal starts the voice recognition function. , the voice collected by the headset.
其中,对于步骤S60来说,当终端开启语音识别功能后,如果检测到有耳机与自身连接,则向与自身连接的耳机发起建立两者之间的语音传输链路的请求,一旦接收到耳机响应终端发起的建立两者之间语音传输链路的请求的响应消息后便立即建立两者之间的语音传输链路,在建立好两者之间的语音传输链路后,则通过该语音传输链路接收耳机发送的语音并进行语音识别。当然,可以理解的是,当通过耳机开启终端的语音识别功能后,终端也可以不向耳机发送建立与耳机之间的语音传输链路的请求,而是通过自身的麦克风或者与自身连接的其它可以获取语音的设备获取用户输入的待识别语音,详细过程本发明不再赘述。In the step S60, when the terminal turns on the voice recognition function, if it is detected that the earphone is connected to itself, the headset connected to the self initiates a request for establishing a voice transmission link between the two, once the headset is received. Responding to the response message initiated by the terminal to establish a request for the voice transmission link between the two, the voice transmission link between the two is established immediately. After the voice transmission link between the two is established, the voice is transmitted. The transmission link receives the voice sent by the headset and performs voice recognition. Of course, it can be understood that after the voice recognition function of the terminal is turned on through the earphone, the terminal may not send a request for establishing a voice transmission link with the earphone to the earphone, but by using its own microphone or other connection with itself. The device that can obtain the voice obtains the voice to be recognized input by the user, and the detailed process is not described in detail in the present invention.
而且,值得注意的是,对于步骤S61来说,当用户输入待识别语音的时机不对时,很可能会导致语音识别失败,例如,在耳机发送完开启指令之后,耳机和终端之间的语音传输链路还未建立好,但是用于已经开始待识别的语音,则导致终端无法获取到的用户在终端开启语音识别功能的时候和在终端建立语音传输链路这一段时间内输出的待识别语音,致使语音识别失败或不完整。因此,为了使用户能够准确把输入待识别语音的时机,从而提高语音识别的成功率,进而提升用户的体验感,作为优选地实施方式,当终端建立好语音传输链路后,还向耳机发送提示指令以便于耳机在接收到终端发送的提示指令后,向用户播放开始输入待识别语音的提示信号,帮助用户准确把握输入待识别语音的时机,且终端可通过语音传输链路获取耳机采集到的语音。使得终端能够获得全部的待识别语音以提高语音识别的成功率,进而提升用户的体验感。Moreover, it is worth noting that, for step S61, when the timing of the user inputting the voice to be recognized is incorrect, the voice recognition may be caused to fail, for example, after the headset sends the open command, the voice transmission between the headset and the terminal. The link is not yet established, but for the voice that has already started to be recognized, the user cannot obtain the voice to be recognized when the terminal starts the voice recognition function and the voice transmission link is established in the terminal. , causing speech recognition to fail or be incomplete. Therefore, in order to enable the user to accurately input the timing of the voice to be recognized, thereby improving the success rate of the voice recognition, thereby improving the user experience, as a preferred embodiment, after the terminal establishes the voice transmission link, it also sends the voice transmission to the headset. The prompting instruction is convenient for the earphone to play the prompting signal for starting to input the voice to be recognized after receiving the prompting instruction sent by the terminal, to help the user accurately grasp the timing of inputting the voice to be recognized, and the terminal can acquire the earphone through the voice transmission link. Voice. The terminal can obtain all the voices to be recognized to improve the success rate of voice recognition, thereby improving the user experience.
由此可见,本发明充分考虑到终端的语音识别成功率与终端和用户之间的距离有很大关系,且终端的便携性与耳机的便携性又相差甚远,终端通过 上述语音传输链路接收耳机采集到的用户输入的待识别语音,避免了由于用户与终端之间的距离较远导致的语音识别失败,影响用户体验。因此,本方法能够提高终端的语音识别的成功率,进而提升用户的体验感。It can be seen that the present invention fully considers that the speech recognition success rate of the terminal has a great relationship with the distance between the terminal and the user, and the portability of the terminal is far from the portability of the earphone, and the terminal passes The voice transmission link receives the voice to be recognized input by the user collected by the earphone, and avoids the voice recognition failure caused by the distance between the user and the terminal, which affects the user experience. Therefore, the method can improve the success rate of the voice recognition of the terminal, thereby improving the user experience.
在具体实施中,当终端成功获取到待识别语音或成功获得识别结果后,终端便不再需要通过语音传输链路获取待识语音,此时,终端便可以断开语音传输链路以降低终端自身的功耗。并且,如果终端已经获得识别结果,还可以同时关闭语音识别功能。因此,本发明实施例提供了另一种开启终端语音识别功能的方法,在图6的基础上作进一步改进,下面结合附图进行详细说明。In a specific implementation, after the terminal successfully obtains the voice to be recognized or successfully obtains the recognition result, the terminal no longer needs to obtain the voice to be recognized through the voice transmission link. At this time, the terminal can disconnect the voice transmission link to lower the terminal. Its own power consumption. And, if the terminal has obtained the recognition result, the voice recognition function can also be turned off at the same time. Therefore, the embodiment of the present invention provides another method for turning on the voice recognition function of the terminal, which is further improved on the basis of FIG. 6, and is described in detail below with reference to the accompanying drawings.
图7为本发明实施例提供的另一种开启终端语音识别功能的方法的流程图。如图7所示,为了降低终端自身的功耗,作为优选地实施方式,在图6的基础上,在执行完步骤S61后,还包括:FIG. 7 is a flowchart of another method for enabling a voice recognition function of a terminal according to an embodiment of the present invention. As shown in FIG. 7 , in order to reduce the power consumption of the terminal itself, as a preferred embodiment, after performing step S61 on the basis of FIG. 6 , the method further includes:
S70:当达到预设关闭条件时,断开语音传输链路,并关闭语音识别功能。S70: When the preset shutdown condition is reached, the voice transmission link is disconnected, and the voice recognition function is turned off.
其中,上述预设关闭条件具体包括:The foregoing preset closing conditions specifically include:
语音的信号强度低于第一预设值且持续时长达到第二预设值;或语音的信号强度高于第三预设值,且持续时长达到第四预设值;或得到语音识别结果。The signal strength of the voice is lower than the first preset value and the duration is up to the second preset value; or the signal strength of the voice is higher than the third preset value, and the duration is up to the fourth preset value; or the voice recognition result is obtained.
需要说明的是,第一预设值小于第三预设值。当终端获取到的语音的信号强度低于第一预设值,且该信号强度持续低于第一预设值的时间长达第二预设值时,终端则会断开上述语音传输链路,关闭自身的语音识别功能。例如,当第一预设值为终端识别语音的最低信号强度,第二预设值为2秒时,如果终端2秒内获取到的语音的信号强度均小于最低信号强度的语音,终端则认为待识别语音已经输入完毕,会断开上述语音传输链路,关闭自身的语音识别功能。It should be noted that the first preset value is smaller than the third preset value. When the signal strength of the voice acquired by the terminal is lower than the first preset value, and the signal strength continues to be lower than the first preset value for a second preset value, the terminal disconnects the voice transmission link. , turn off its own speech recognition function. For example, when the first preset value is the lowest signal strength of the terminal recognition voice, and the second preset value is 2 seconds, if the signal strength of the voice acquired by the terminal within 2 seconds is less than the voice of the lowest signal strength, the terminal considers that After the voice to be recognized has been input, the voice transmission link is disconnected and its voice recognition function is turned off.
当终端获取到的语音的信号强度高于第三预设值,且该信号强度高于第三预设值的时间长达第四预设值时,终端则会断开上述语音传输链路,关闭 自身的语音识别功能。例如,当第三预设值为终端识别语音的最高信号强度,第四预设值为10秒时,如果终端获取到的语音的信号强度在10秒内均大于最高信号强度,终端则认为待识别语音输入有误,会断开上述语音传输链路,关闭自身的语音识别功能。When the signal strength of the voice acquired by the terminal is higher than the third preset value, and the time when the signal strength is higher than the third preset value is up to the fourth preset value, the terminal disconnects the voice transmission link. shut down Its own voice recognition function. For example, when the third preset value is the highest signal strength of the terminal recognition voice, and the fourth preset value is 10 seconds, if the signal strength of the voice acquired by the terminal is greater than the highest signal strength within 10 seconds, the terminal considers that If the voice input is incorrect, the voice transmission link will be disconnected and its voice recognition function will be turned off.
当终端得到识别结果后,终端则认为完成语音识别,会断开上述语音传输链路,关闭自身的语音识别功能。当然,可以理解的是,为了进一步完善本实施例提供的开启终端语音识别功能的方法,在终端获得识别结果后,终端可以将识别结果的音频发送至耳机,以通过耳机播放音频。当然,也可以通过其他播音设备播放识别结果中的音频,如终端自身的扬声器和与终端连接的音响等。After the terminal obtains the recognition result, the terminal considers that the voice recognition is completed, the voice transmission link is disconnected, and the voice recognition function is turned off. Certainly, it can be understood that, in order to further improve the method for enabling the terminal voice recognition function provided by the embodiment, after the terminal obtains the recognition result, the terminal may send the audio of the recognition result to the earphone to play the audio through the earphone. Of course, the audio in the recognition result can also be played by other broadcasting devices, such as the speaker of the terminal itself and the sound connected to the terminal.
同样的,为了进一步完善本实施例提供的开启终端语音识别功能的方法,在终端获得识别结果后,当识别结果中包含字符、图片或视频信息时,终端还可以通过显示屏显示识别结果,其中,显示屏可以是终端自身的显示屏,还可以是与终端连接的显示设备,如还可以通过与终端连接的投影仪显示识别结果。Similarly, in order to further improve the method for enabling the terminal voice recognition function provided by the embodiment, after the terminal obtains the recognition result, when the recognition result includes characters, pictures or video information, the terminal may also display the recognition result through the display screen, wherein The display screen may be the display screen of the terminal itself, or may be a display device connected to the terminal, for example, the recognition result may be displayed by a projector connected to the terminal.
因此,本实施例提供的开启终端语音识别功能的方法可以通过及时关闭终端的语音识别功能降低终端自身的功耗以延长终端的待机时长,从而进一步地提升用户的体验感。当然,还可以通过耳机播放识别结果中的音频和显示屏显示识别结果中的字符、图片或视频信息等内容,以更进一步地提升用户的体验感。Therefore, the method for enabling the voice recognition function of the terminal provided by the embodiment can reduce the power consumption of the terminal by shortening the voice recognition function of the terminal in time to extend the standby time of the terminal, thereby further improving the user experience. Of course, the audio in the recognition result and the display screen can display the characters, pictures or video information in the recognition result through the earphone to further enhance the user's experience.
当然,可以理解的是,还可以在图5的基础上对本发明实施例提供的开启终端语音识别功能的方法作进一步改进,即在执行完步骤S51后,继续执行步骤S70,本文不再详细赘述。Of course, it can be understood that the method for enabling the voice recognition function of the terminal provided by the embodiment of the present invention can be further improved on the basis of FIG. 5, that is, after step S51 is performed, step S70 is continued, and details are not described herein again. .
上文对于本发明提供的一种开启终端语音识别功能的方法的实施例进行了详细的描述,本发明还提供了一种与该方法对应的开启终端语音识别功能的装置,由于装置部分的实施例与方法部分的实施例相互照应,因此装置部 分的实施例请参见方法部分的实施例的描述,这里暂不赘述。An embodiment of the method for enabling the terminal voice recognition function provided by the present invention is described in detail above. The present invention further provides a device for enabling the terminal voice recognition function corresponding to the method, which is implemented by the device part. The examples of the method and the method part are mutually responsive, so the device part For the embodiment of the method, please refer to the description of the embodiment of the method section, and details are not described herein.
图8为本发明实施例提供的一种开启终端语音识别功能的装置的结构图。如图8所示,该装置包括:FIG. 8 is a structural diagram of an apparatus for enabling a voice recognition function of a terminal according to an embodiment of the present invention. As shown in Figure 8, the device includes:
接收模块80,用于接收耳机发送的开启语音识别功能的开启指令。The receiving module 80 is configured to receive an opening instruction of the voice recognition function that is sent by the earphone.
开启模块81,用于开启语音识别功能以进行语音识别。The module 81 is turned on for turning on the voice recognition function for voice recognition.
由此可见,本实施例提供的开启终端语音识别功能的装置可以在终端的语音识别功能处于关闭的状态下,无需手动操作,终端仅通过接收模块接收耳机发送的开启指令,便可以由开启模块开启自身的语音识别功能以进行语音识别,相对于现有技术而言,本装置不仅能够降低终端的功耗,而且能够提高用户使用语音识别功能的便捷性,进一步提升用户的体验感。It can be seen that the device for enabling the voice recognition function of the terminal in this embodiment can be operated without the manual operation when the voice recognition function of the terminal is off, and the terminal can only open the module by receiving the opening command sent by the earphone through the receiving module. The voice recognition function is enabled to perform voice recognition. Compared with the prior art, the device can not only reduce the power consumption of the terminal, but also improve the convenience of the user using the voice recognition function, thereby further improving the user experience.
此外,本发明实施例还提供了一种终端,该终端包括存储器和处理器,处理器通过调用存储于存储器中的指令以实现上述任一实施例所提供的开启终端语音识别功能的方法。由此可见,本实施例提供的终端可以在终端的语音识别功能处于关闭的状态下,无需手动操作,终端仅通过接收耳机发送的开启指令,便可以开启自身的语音识别功能以进行语音识别,相对于现有技术而言,本终端不仅能够降低功耗,而且能够提高用户使用语音识别功能的便捷性,进一步提升用户的体验感。In addition, an embodiment of the present invention further provides a terminal, where the terminal includes a memory and a processor, and the processor implements a method for turning on the voice recognition function of the terminal provided by any of the foregoing embodiments by calling an instruction stored in the memory. It can be seen that the terminal provided by the embodiment can be used in the state that the voice recognition function of the terminal is off, without manual operation, and the terminal can open its own voice recognition function for voice recognition only by receiving an open command sent by the earphone. Compared with the prior art, the terminal can not only reduce power consumption, but also improve the convenience of the user using the voice recognition function, thereby further improving the user experience.
为了使本领域的技术人员更好的理解本发明的技术方案,下面结合附图、本发明实施例提供的通过耳机开启终端语音识别功能的方法及装置、本发明实施例提供的开启终端语音识别功能的方法及装置、本发明实施例提供的耳机和终端,对上述本发明作进一步的详细说明。In order to enable a person skilled in the art to better understand the technical solution of the present invention, the method and device for opening the voice recognition function of the terminal through the earphone, and the voice recognition of the open terminal provided by the embodiment of the present invention are provided below with reference to the accompanying drawings and the embodiments of the present invention. The above described invention will be further described in detail by the method and device of the function, the earphone and the terminal provided by the embodiment of the present invention.
图9为本发明实施例提供的耳机和终端的应用场景示意图。如图9所示,该应用过程包括:FIG. 9 is a schematic diagram of an application scenario of a headset and a terminal according to an embodiment of the present invention. As shown in FIG. 9, the application process includes:
S90:当用户需要使用终端的语音识别功能的时候,用户向耳机说出含有预设关键词的语音,例如,“开启网络语音识别功能”。 S90: When the user needs to use the voice recognition function of the terminal, the user speaks the voice containing the preset keyword to the earphone, for example, “turn on the network voice recognition function”.
S91:当耳机捕捉到预设关键词时,耳机通过指令通道向终端发送开启指令。S91: When the earphone captures the preset keyword, the earphone sends an open command to the terminal through the command channel.
S92:终端在接收到开启指令后,开启网络语音识别功能,并建立与耳机的语音传输链路。S92: After receiving the open command, the terminal starts the network voice recognition function and establishes a voice transmission link with the earphone.
S93:用户向耳机说出待识别语音。S93: The user speaks the voice to be recognized to the earphone.
S94:耳机采集待识别语音,并通过语音传输链路向终端发送待识别语音。S94: The earphone collects the voice to be recognized, and sends the voice to be recognized to the terminal through the voice transmission link.
S95:终端通过语音传输链路接收待识别语音,并通过网络语音识别模块将待识别语音发送至网络语音识别服务器。S95: The terminal receives the to-be-identified voice through the voice transmission link, and sends the to-be-identified voice to the network voice recognition server through the network voice recognition module.
S96:网络语音识别服务器接收待识别语音进行识别,得到识别结果,并将识别结果发送至终端。S96: The network speech recognition server receives the speech to be recognized for recognition, obtains the recognition result, and sends the recognition result to the terminal.
S97:终端接收识别结果,并将识别结果的音频发送至耳机。S97: The terminal receives the recognition result, and sends the audio of the recognition result to the earphone.
S98:耳机接收识别结果的音频,并向用户播放该音频。S98: The earphone receives the audio of the recognition result and plays the audio to the user.
上文通过对本发明实施例提供的方法及装置、耳机和终端的应用过程的具体描述,可以看出本发明实施例提供的方法及装置、耳机和终端,能够在终端的语音识别功能处于关闭的状态下,无需手动操作,耳机仅通过包含有预设关键词的语音便可以开启终端的语音识别功能,并可以正常使用终端的语音识别功能,因此,本发明实施例提供的方法及装置、耳机和终端能够在节约终端自身功耗的情况下,通过语音开启并使用终端的语音识别功能,从而提高用户使用语音识别功能的便捷性,进一步提升用户的体验感。The method and device, the earphone and the terminal provided by the embodiment of the present invention can be seen that the voice recognition function of the terminal is closed in a detailed description of the method and device, the application method of the earphone and the terminal provided by the embodiment of the present invention. In the state, the manual voice operation is not required, and the voice recognition function of the terminal can be enabled only by the voice containing the preset keyword, and the voice recognition function of the terminal can be used normally. Therefore, the method and device provided by the embodiment of the present invention and the earphone are provided. And the terminal can open and use the voice recognition function of the terminal through voice to save the power consumption of the terminal, thereby improving the convenience of the user using the voice recognition function, and further improving the user experience.
以上对本发明所提供的本发明实施例提供的通过耳机开启终端语音识别功能的方法及装置、本发明实施例提供的开启终端语音识别功能的方法及装置、本发明实施例提供的耳机和终端进行了详细介绍。说明书中各个实施例采用递进的方式描述,每个实施例重点说明都是与其它实施例的不同之处,各个实施例之间相同相似部分互相参见即可。The method and device for enabling the voice recognition function of the terminal through the earphone provided by the embodiment of the present invention provided by the embodiment of the present invention, the method and device for opening the voice recognition function of the terminal provided by the embodiment of the present invention, and the earphone and the terminal provided by the embodiment of the present invention Detailed introduction. The various embodiments in the specification are described in a progressive manner, and each embodiment is described as being different from the other embodiments, and the same similar parts between the various embodiments may be referred to each other.
应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明原理的前提下,还可以对本发明进行若干改进和修饰,这些改进和修饰也落入本 发明权利要求的保护范围内。It should be noted that those skilled in the art can also make several improvements and modifications to the present invention without departing from the principles of the present invention. Within the scope of the claims of the invention.
还需要说明的是,在本说明书中,诸如第一和第二之类的关系术语仅仅用来将一个实体或者操作与另一个实体或者操作区分开来,而不一定要求或者暗示这些实体或者操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何变体意在涵盖非排他性的包含,从而使得包括一系列的要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其它要素,或者还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。 It should also be noted that in this specification, relational terms such as first and second are used merely to distinguish one entity or operation from another entity or operation, without necessarily requiring or implying such entities or operations. There is any such actual relationship or order between them. Furthermore, the term "comprises" or "comprises" or "comprises" or "comprises" or "includes" or "includes" or "includes" or "includes" or "includes" Other elements, or elements that are inherent to such a process, method, article, or device. An element that is defined by the phrase "comprising a ..." does not exclude the presence of additional equivalent elements in the process, method, item, or device that comprises the element.

Claims (11)

  1. 一种通过耳机开启终端语音识别功能的方法,其特征在于,包括:A method for enabling a voice recognition function of a terminal through a headset, comprising:
    采集语音;Collecting speech;
    当确定所述语音中包含预设关键词时,向终端发送开启语音识别功能的开启指令以使所述终端进行语音识别。When it is determined that the preset keyword is included in the voice, an open command for turning on the voice recognition function is sent to the terminal to enable the terminal to perform voice recognition.
  2. 根据权利要求1所述的方法,其特征在于,在所述向所述终端发送开启语音识别功能的开启指令之后,还包括:The method according to claim 1, wherein after the sending the opening instruction of the voice recognition function to the terminal, the method further comprises:
    响应所述终端发起的建立语音传输链路的请求;Responding to a request initiated by the terminal to establish a voice transmission link;
    当检测到所述语音传输链路接通时,可通过所述语音传输链路传输语音至所述终端。When it is detected that the voice transmission link is turned on, voice can be transmitted to the terminal through the voice transmission link.
  3. 根据权利要求1或2所述的方法,其特征在于,还包括:The method according to claim 1 or 2, further comprising:
    当检测到所述终端关闭所述语音传输链路时,停止将语音通过所述语音传输链路传输至所述终端。When detecting that the terminal closes the voice transmission link, stopping transmission of voice to the terminal through the voice transmission link.
  4. 根据权利要求3所述的方法,其特征在于,所述向所述终端发送开启语音识别功能的开启指令具体为:The method according to claim 3, wherein the opening command for sending the voice recognition function to the terminal is specifically:
    通过基于BLE协议的指令通道向所述终端发送所述开启指令。The open command is sent to the terminal through an instruction channel based on the BLE protocol.
  5. 一种通过耳机开启终端语音识别功能的装置,其特征在于,包括:An apparatus for enabling a voice recognition function of a terminal through a headset, comprising:
    采集模块,用于采集语音;An acquisition module for collecting voices;
    开启模块,用于当确定所述语音中包含预设关键词时,向终端发送开启语音识别功能的开启指令以使所述终端进行语音识别。The opening module is configured to, when determining that the voice includes a preset keyword, send an open command to enable the voice recognition function to enable the terminal to perform voice recognition.
  6. 一种耳机,其特征在于,包括存储器和处理器,所述处理器通过调用存储于所述存储器中的指令以执行如权利要求1-4任意一项所述的方法。An earphone characterized by comprising a memory and a processor, the processor executing the method of any one of claims 1-4 by invoking an instruction stored in the memory.
  7. 一种开启终端语音识别功能的方法,其特征在于,包括:A method for enabling a voice recognition function of a terminal, comprising:
    接收耳机发送的开启语音识别功能的开启指令;Receiving an open command for turning on the voice recognition function sent by the earphone;
    开启所述语音识别功能以进行语音识别。 The speech recognition function is turned on for speech recognition.
  8. 根据权利要求7所述的方法,其特征在于,在所述开启所述语音识别功能后,还包括:The method according to claim 7, wherein after the opening the voice recognition function, the method further comprises:
    向所述耳机发起建立语音传输链路的请求以建立所述语音传输链路;Initiating a request to establish a voice transmission link to the headset to establish the voice transmission link;
    通过所述语音传输链路接收所述耳机发送的语音,并进行语音识别。Receiving voice transmitted by the earphone through the voice transmission link, and performing voice recognition.
  9. 根据权利要求7或8所述的方法,其特征在于,在通过所述语音传输链路接收所述耳机发送的语音后,还包括:The method according to claim 7 or 8, wherein after receiving the voice sent by the headset through the voice transmission link, the method further includes:
    当达到预设关闭条件时,断开所述语音传输链路,并关闭所述语音识别功能;When the preset shutdown condition is reached, disconnecting the voice transmission link and turning off the voice recognition function;
    所述预设关闭条件具体包括:The preset shutdown condition specifically includes:
    语音的信号强度低于第一预设值且持续时长达到第二预设值;The signal strength of the voice is lower than the first preset value and the duration is up to the second preset value;
    或语音的信号强度高于第三预设值,且持续时长达到第四预设值;Or the signal strength of the voice is higher than the third preset value, and the duration is up to the fourth preset value;
    或得到语音识别结果;Or get a speech recognition result;
    其中,所述第一预设值小于所述第三预设值。The first preset value is smaller than the third preset value.
  10. 一种开启终端语音识别功能的装置,其特征在于,包括:An apparatus for enabling a voice recognition function of a terminal, comprising:
    接收模块,用于接收耳机发送的开启语音识别功能的开启指令;a receiving module, configured to receive an open instruction for turning on the voice recognition function sent by the earphone;
    开启模块,用于开启所述语音识别功能以进行语音识别。The module is turned on for turning on the voice recognition function for voice recognition.
  11. 一种终端,其特征在于,包括存储器和处理器,所述处理器通过调用存储于所述存储器中的指令以执行权利要求7-9任意一项所述的方法。 A terminal, comprising a memory and a processor, the processor executing the method of any one of claims 7-9 by invoking an instruction stored in the memory.
PCT/CN2017/108583 2017-08-29 2017-10-31 Method and device for enabling voice recognition function of terminal, earphone and terminal WO2019041512A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201710757815.8A CN107393535A (en) 2017-08-29 2017-08-29 A kind of method, apparatus, earphone and terminal for opening terminal speech identification function
CN201710757815.8 2017-08-29

Publications (1)

Publication Number Publication Date
WO2019041512A1 true WO2019041512A1 (en) 2019-03-07

Family

ID=60346153

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/108583 WO2019041512A1 (en) 2017-08-29 2017-10-31 Method and device for enabling voice recognition function of terminal, earphone and terminal

Country Status (2)

Country Link
CN (1) CN107393535A (en)
WO (1) WO2019041512A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111785277A (en) * 2020-06-29 2020-10-16 北京捷通华声科技股份有限公司 Speech recognition method, speech recognition device, computer-readable storage medium and processor

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20210256979A1 (en) * 2018-06-29 2021-08-19 Huawei Technologies Co., Ltd. Voice Control Method, Wearable Device, and Terminal
CN109218899A (en) * 2018-08-29 2019-01-15 出门问问信息科技有限公司 A kind of recognition methods, device and the intelligent sound box of interactive voice scene
CN112581949B (en) * 2019-09-29 2023-09-01 深圳市万普拉斯科技有限公司 Device control method, device, electronic device and readable storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030130852A1 (en) * 2002-01-07 2003-07-10 Kabushiki Kaisha Toshiba Headset with radio communication function for speech processing system using speech recognition
US20070060118A1 (en) * 2005-09-13 2007-03-15 International Business Machines Corporation Centralized voice recognition unit for wireless control of personal mobile electronic devices
CN103558916A (en) * 2013-11-07 2014-02-05 百度在线网络技术(北京)有限公司 Man-machine interaction system, method and device
CN105682008A (en) * 2016-02-29 2016-06-15 宇龙计算机通信科技(深圳)有限公司 Method and device for controlling terminal through earphone
CN105895098A (en) * 2016-06-08 2016-08-24 乐视控股(北京)有限公司 Play control method and device

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000115330A (en) * 1998-09-30 2000-04-21 Nec Corp Portable telephone set and portable audio apparatus connected thereto

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030130852A1 (en) * 2002-01-07 2003-07-10 Kabushiki Kaisha Toshiba Headset with radio communication function for speech processing system using speech recognition
US20070060118A1 (en) * 2005-09-13 2007-03-15 International Business Machines Corporation Centralized voice recognition unit for wireless control of personal mobile electronic devices
CN103558916A (en) * 2013-11-07 2014-02-05 百度在线网络技术(北京)有限公司 Man-machine interaction system, method and device
CN105682008A (en) * 2016-02-29 2016-06-15 宇龙计算机通信科技(深圳)有限公司 Method and device for controlling terminal through earphone
CN105895098A (en) * 2016-06-08 2016-08-24 乐视控股(北京)有限公司 Play control method and device

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111785277A (en) * 2020-06-29 2020-10-16 北京捷通华声科技股份有限公司 Speech recognition method, speech recognition device, computer-readable storage medium and processor

Also Published As

Publication number Publication date
CN107393535A (en) 2017-11-24

Similar Documents

Publication Publication Date Title
WO2017181730A1 (en) Bluetooth headset and communication method based on same
CN107277754B (en) Bluetooth connection method and Bluetooth peripheral equipment
WO2019237491A1 (en) Wireless earphone pairing method and device and wireless earphones
US9990921B2 (en) User focus activated voice recognition
TWI489372B (en) Voice control method and mobile terminal apparatus
WO2018201944A1 (en) Apparatus control method and device
US9280539B2 (en) System and method for translating speech, and non-transitory computer readable medium thereof
US11688389B2 (en) Method for processing voice signals and terminal thereof
CN107564523B (en) Earphone answering method and device and earphone
CN107978316A (en) The method and device of control terminal
WO2012055361A1 (en) Processing method for earphones and user equipment
WO2019041512A1 (en) Method and device for enabling voice recognition function of terminal, earphone and terminal
CN108665899A (en) A kind of voice interactive system and voice interactive method
CN105794186A (en) Method, device and electronic device for controlling application program
WO2014183529A1 (en) Mobile terminal talk mode switching method, device and storage medium
US11178280B2 (en) Input during conversational session
US20170110131A1 (en) Terminal control method and device, voice control device and terminal
CN105677290B (en) The control method and client of speech application
CN105912111B (en) The method and speech recognition equipment of end voice dialogue in human-computer interaction
CN110620970A (en) Earphone touch control method and device, wireless earphone and TWS earphone
WO2016082344A1 (en) Voice control method and apparatus, and storage medium
JP7330066B2 (en) Speech recognition device, speech recognition method and its program
CN108432220B (en) Method and terminal for switching call mode
WO2017185699A1 (en) Method for finding terminal, and terminal
CN103795873A (en) Method and device for reminding users of responses according to phone call state

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17923221

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17923221

Country of ref document: EP

Kind code of ref document: A1