CN112306560B - Method and apparatus for waking up an electronic device - Google Patents

Method and apparatus for waking up an electronic device Download PDF

Info

Publication number
CN112306560B
CN112306560B CN202010118496.8A CN202010118496A CN112306560B CN 112306560 B CN112306560 B CN 112306560B CN 202010118496 A CN202010118496 A CN 202010118496A CN 112306560 B CN112306560 B CN 112306560B
Authority
CN
China
Prior art keywords
wake
voice
recognition result
recognized
electronic device
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010118496.8A
Other languages
Chinese (zh)
Other versions
CN112306560A (en
Inventor
不公告发明人
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing ByteDance Network Technology Co Ltd
Original Assignee
Beijing ByteDance Network Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing ByteDance Network Technology Co Ltd filed Critical Beijing ByteDance Network Technology Co Ltd
Priority to CN202010118496.8A priority Critical patent/CN112306560B/en
Publication of CN112306560A publication Critical patent/CN112306560A/en
Application granted granted Critical
Publication of CN112306560B publication Critical patent/CN112306560B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/4401Bootstrapping
    • G06F9/4418Suspend and resume; Hibernate and awake
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computer Security & Cryptography (AREA)
  • Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Telephone Function (AREA)

Abstract

The embodiment of the application discloses a method and a device for waking up electronic equipment. One embodiment of the method comprises: acquiring a voice to be recognized; recognizing the voice to be recognized to generate a first voice recognition result; determining whether a wakeup word containing a first voice recognition result exists in a preset wakeup word set, wherein the number of characters contained in the wakeup word is larger than that of characters contained in the first voice recognition result; responding to the awakening words containing the first voice recognition result in the preset awakening word set, and generating pre-awakening information; and waking up the electronic equipment based on the pre-wake-up information. According to the embodiment, the wake-up words can be matched before the user speaks the complete wake-up words, so that the wake-up speed of the electronic equipment is increased.

Description

Method and apparatus for waking up an electronic device
Technical Field
The embodiment of the application relates to the technical field of computers, in particular to a method and a device for waking up electronic equipment.
Background
With the rapid development of computer and internet technologies, more and more intelligent electronic devices are widely used. In order to reduce the power consumption of the electronic device, the electronic device usually needs to be awakened before using the corresponding function, so that the electronic device enters the working mode.
The relevant way is usually that the user speaks a wake word in speech form and the electronic device detects the complete wake word spoken by the user to wake up the device.
Disclosure of Invention
The embodiment of the application provides a method and a device for waking up an electronic device.
In a first aspect, an embodiment of the present application provides a method for waking up an electronic device, where the method includes: acquiring a voice to be recognized; recognizing the voice to be recognized to generate a first voice recognition result; determining whether a wakeup word containing a first voice recognition result exists in a preset wakeup word set, wherein the number of characters contained in the wakeup word is larger than that of characters contained in the first voice recognition result; responding to the fact that the awakening words containing the first voice recognition result exist in the preset awakening word set, and generating pre-awakening information; waking up the electronic device based on the pre-wake-up information.
In some embodiments, the determining whether a wakeup word including a character string in the first speech recognition result exists in the preset wakeup word set includes: and responding to the fact that a matched awakening word exists in the preset awakening word set, and determining that the awakening word containing the character string in the first voice recognition result exists in the preset awakening word set, wherein the matched awakening word contains the character string consistent with the first voice recognition result.
In some embodiments, the waking up the electronic device based on the pre-wake-up information includes: in response to the fact that the pre-awakening information is used for representing that pre-awakening is allowed, acquiring subsequent voice associated with the voice to be recognized, wherein the acquisition time of the subsequent voice is later than that of the voice to be recognized; and awakening the electronic equipment based on the matching between the voice to be recognized and the subsequent voice and the preset awakening word set.
In some embodiments, the method further comprises: in response to determining that the pre-wake information is used to characterize that pre-wake is allowed, sending response information characterizing that the electronic device is woken up.
In some embodiments, the waking up the electronic device based on the matching between the to-be-recognized speech and the subsequent speech and the preset wake-up word set includes: performing voice recognition on the subsequent voice to generate a second voice recognition result; splicing the first voice recognition result and the second voice recognition result according to the acquisition time to generate a target voice recognition result; and waking up the electronic equipment in response to determining that the target voice recognition result is consistent with the matched wake-up word.
In a second aspect, an embodiment of the present application provides an apparatus for waking up an electronic device, where the apparatus includes: an acquisition unit configured to acquire a voice to be recognized; the recognition unit is configured to recognize the voice to be recognized and generate a first voice recognition result; a determining unit configured to determine whether a wakeup word including a first voice recognition result exists in a preset wakeup word set, wherein the number of characters included in the wakeup word is greater than the number of characters included in the first voice recognition result; a generating unit configured to generate pre-wakeup information in response to determining that a wakeup word including a first voice recognition result exists in a preset wakeup word set; a wake-up unit configured to wake up the electronic device based on the pre-wake-up information.
In some embodiments, the determining unit is further configured to: and responding to the fact that a matched awakening word exists in the preset awakening word set, and determining that the awakening word containing the character string in the first voice recognition result exists in the preset awakening word set, wherein the matched awakening word contains the character string consistent with the first voice recognition result.
In some embodiments, the wake-up unit comprises: an acquisition module configured to acquire a subsequent voice associated with the voice to be recognized in response to determining that the pre-wake information is used to characterize that pre-wake is allowed, wherein the acquisition time of the subsequent voice is later than that of the voice to be recognized; and the awakening module is configured to awaken the electronic equipment based on the matching between the voice to be recognized and the subsequent voice and the preset awakening word set.
In some embodiments, the apparatus further comprises: a sending unit configured to send response information characterizing that the electronic device is woken up, in response to determining that the pre-wake-up information is used to characterize that pre-wake-up is allowed.
In some embodiments, the wake-up module comprises: the recognition submodule is configured to perform voice recognition on the subsequent voice and generate a second voice recognition result; the splicing submodule is configured to splice the first voice recognition result and the second voice recognition result according to the acquisition time to generate a target voice recognition result; a wake-up sub-module configured to wake up the electronic device in response to determining that the target speech recognition result is consistent with the matching wake-up word.
In a third aspect, an embodiment of the present application provides a terminal, where the terminal includes: one or more processors; a storage device having one or more programs stored thereon; when the one or more programs are executed by the one or more processors, the one or more processors are caused to implement the method as described in any implementation of the first aspect.
In a fourth aspect, the present application provides a computer-readable medium, on which a computer program is stored, which when executed by a processor implements the method described in any implementation manner of the first aspect.
According to the method and the device for waking up the electronic equipment, the voice to be recognized is firstly acquired. And then, recognizing the voice to be recognized to generate a first voice recognition result. And then, determining whether a wakeup word containing the first voice recognition result exists in a preset wakeup word set. And the number of the characters included in the awakening word is larger than that of the characters included in the first voice recognition result. And then, responding to the fact that the awakening words containing the first voice recognition result exist in the preset awakening word set, and generating the pre-awakening information. And finally, waking up the electronic equipment based on the pre-wake-up information. Therefore, the user can start to match the awakening words before the user speaks the complete awakening words, and the awakening speed of the electronic equipment is improved. From the perspective of the user, the time for waiting for the electronic equipment to wake up is reduced, and the user experience is improved.
Drawings
Other features, objects and advantages of the present application will become more apparent upon reading of the following detailed description of non-limiting embodiments thereof, made with reference to the accompanying drawings in which:
FIG. 1 is an exemplary system architecture diagram in which one embodiment of the present application may be applied;
FIG. 2 is a flow diagram of one embodiment of a method for waking up an electronic device according to the present application;
FIG. 3 is a schematic diagram of one application scenario of a method for waking up an electronic device according to an embodiment of the present application;
FIG. 4 is a flow diagram of yet another embodiment of a method for waking an electronic device according to the present application;
FIG. 5 is a block diagram of one embodiment of an apparatus for waking up an electronic device according to the application;
FIG. 6 is a schematic block diagram of an electronic device suitable for use to implement embodiments of the present application.
Detailed Description
The present application will be described in further detail with reference to the following drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of the relevant invention and not restrictive of the invention. It should be noted that, for convenience of description, only the portions related to the related invention are shown in the drawings.
It should be noted that the embodiments and features of the embodiments in the present application may be combined with each other without conflict. The present application will be described in detail below with reference to the embodiments with reference to the attached drawings.
Fig. 1 shows an exemplary architecture 100 to which the method for waking up an electronic device or the apparatus for waking up an electronic device of the present application may be applied.
As shown in fig. 1, system architecture 100 may include terminal device 101, network 102, and server 103. Network 102 is the medium used to provide communication links between terminal devices 101 and server 103. Network 102 may include various connection types, such as wired, wireless communication links, or fiber optic cables, to name a few.
The terminal apparatus 101 interacts with the server 103 through the network 102 to receive or transmit messages and the like. Various communication client applications, such as a web browser application, a shopping application, a search application, an instant messaging tool, a mailbox client, social platform software, etc., may be installed on the terminal device 101.
The terminal apparatus 101 may be hardware or software. When the terminal device 101 is hardware, it may be various electronic devices that have a camera and support communication, including but not limited to a smart phone, a tablet computer, a smart player device, a laptop portable computer, a desktop computer, and the like. When the terminal apparatus 101 is software, it can be installed in the electronic apparatuses listed above. It may be implemented as multiple pieces of software or software modules (e.g., software or software modules used to provide distributed services) or as a single piece of software or software module. And is not particularly limited herein.
The server 103 may be a server providing various services, such as a background server providing support for wake-up of the terminal device 101. The background server can analyze and process the received voice to be recognized, and feed back the processing result (whether the processing result contains the matched awakening word) to the terminal equipment.
The server may be hardware or software. When the server is hardware, it may be implemented as a distributed server cluster formed by multiple servers, or may be implemented as a single server. When the server is software, it may be implemented as multiple pieces of software or software modules (e.g., software or software modules for providing distributed services) or as a single piece of software or software module. And is not particularly limited herein.
It should be noted that the method for waking up the electronic device provided by the embodiment of the present disclosure is generally performed by the terminal device 101, and accordingly, the apparatus for waking up the electronic device is generally disposed in the terminal device 101. At this time, the network 102 and the server 103 may not exist. It should be further noted that the method for waking up the electronic device may also be performed by the server 103, and accordingly, the apparatus for waking up the electronic device may be disposed in the server 103.
It should be understood that the number of terminal devices, networks, and servers in fig. 1 is merely illustrative. There may be any number of terminal devices, networks, and servers, as desired for implementation.
With continued reference to FIG. 2, a flow 200 of one embodiment of a method for waking up an electronic device in accordance with the present application is shown. The method for waking up the electronic device comprises the following steps:
step 201, obtaining the voice to be recognized.
In this embodiment, an execution subject of the method for waking up the electronic device (such as the terminal device 101 shown in fig. 1) may acquire the voice to be recognized in a wired connection manner or a wireless connection manner. The speech to be recognized may include a speech of a target user. The target user may be any user, or may be a user determined according to a rule, for example, a user having a wake-up authority for an electronic device determined according to a Voiceprint Recognition (VPR) technology. Optionally, the speech to be recognized may further include speech of which the length of the speech segment is smaller than a preset threshold.
Specifically, the executing body may acquire the voice to be recognized, which is pre-stored locally, or may acquire the voice to be recognized, which is transmitted by a voice input device (e.g., a microphone) communicatively connected to the executing body.
It should be noted that the speech may include, but is not limited to, a complete speech segment with a definite beginning and ending. The speech may also be a speech segment in a continuous stream of speech.
Step 202, recognizing the voice to be recognized, and generating a first voice recognition result.
In this embodiment, the executing entity may perform recognition on the speech to be recognized acquired in step 201 by using various speech recognition techniques to generate a first speech recognition result. The voice recognition technology can comprise a traditional voice recognition method and an end-to-end acoustic model training method combined with an artificial neural network. The above conventional speech recognition methods generally include feature extraction, acoustic model matching, language models, and the like. The end-to-end acoustic model training method may include generating a speech recognition result using a model constructed based on a DTW (Dynamic Time Warping) algorithm, a CTC (connection Temporal Classification) algorithm, and the like.
In some optional implementation manners of this embodiment, the executing body may further perform word segmentation on a result of recognizing the speech to be recognized according to a preset number of characters, and generate at least one word with a number of characters not exceeding the preset number of characters as the first speech recognition result.
Step 203, determining whether a preset wake-up word set contains a wake-up word containing a first voice recognition result.
In this embodiment, the execution main body may determine whether a wakeup word including the first speech recognition result exists in a preset wakeup word set in various ways. Wherein the number of characters included in the wake-up word is generally greater than the number of characters included in the first speech recognition result. As an example, the execution main body may use the first speech recognition result as a search term, and determine whether a search result including the search term exists in the preset wake-up term set.
In some optional implementation manners of this embodiment, in response to determining that a matching wakeup word exists in the preset wakeup word set, the execution main body may determine that a wakeup word including a character string in the first speech recognition result exists in the preset wakeup word set. And the matched awakening words comprise character strings consistent with the first voice recognition result. For example, the first N characters of the matching wake-up word are consistent with the first speech recognition result. As an example, the first speech recognition result may be "small K" or "small K small". As yet another example, the first speech recognition result may also be "XX" or "XX-the same". Then, the matching wake words corresponding to the above-described first speech recognition result may be "small K" and "XX classmates", respectively.
Step 204, in response to determining that the preset wake-up word set contains a wake-up word including the first speech recognition result, generating pre-wake-up information.
In this embodiment, in response to determining that there is a search result including the search term, the execution subject may generate pre-wake information indicating that pre-wake is allowed. In response to determining that there is no search result including the search term, the execution subject may generate pre-wake-up information characterizing that pre-wake-up is not allowed.
And step 205, waking up the electronic device based on the pre-wake-up information.
In this embodiment, based on the pre-wake-up information generated in the step 203, the executing entity may wake up the electronic device in various ways. As an example, the execution subject may wake up the electronic device directly in response to determining that the pre-wake-up information is used to characterize that pre-wake-up is allowed. It should be noted that the awakened electronic device may be the execution main body itself.
With continued reference to fig. 3, fig. 3 is a schematic diagram of an application scenario of a method for waking up an electronic device according to an embodiment of the present application. In the application scenario of fig. 3, a user 301 is ready to wake up a smart terminal device (e.g. a smart sound box) 302. Thus, the user 301 speaks the wake word by word towards the smart terminal device 302. The intelligent terminal device 302 acquires the voice 303 of the user 301 speaking the awakening word by word. Then, the intelligent terminal device 302 performs speech recognition on the speech 303 to generate a first speech recognition result "small K small" 304. According to the fact that the wake-up word "small K" including the first speech recognition result 304 exists in the preset wake-up word set, the smart terminal device 302 may generate the pre-wake-up information 305 indicating that pre-wake-up is allowed. Then, according to the above pre-wake-up information 305 for representing that pre-wake-up is allowed, the smart terminal device 302 can be directly woken up, i.e. switched to the operating mode.
At present, in one of the prior arts, after a user completely speaks a wake word, the electronic device matches the wake word again, which results in a low wake speed of the electronic device. In the method provided by the embodiment of the application, the first voice recognition result containing the characters with the number smaller than that of the characters contained in the awakening word is generated through the voice to be recognized, the pre-awakening information is further generated, and then the electronic equipment is awakened according to the pre-awakening information. Therefore, the user can start to match the awakening words before the user speaks the complete awakening words, and the awakening speed of the electronic equipment is improved. From the perspective of the user, the time for waiting for the electronic equipment to wake up is reduced, and the user experience is improved.
With further reference to fig. 4, a flow 400 of yet another embodiment of a method for waking an electronic device is shown. The flow 400 of the method for waking up an electronic device includes the following steps:
step 401, obtaining a voice to be recognized.
Step 402, recognizing the voice to be recognized, and generating a first voice recognition result.
Step 403, in response to determining that there is a matching wakeup word in the preset wakeup word set, determining that there is a wakeup word containing the character string in the first speech recognition result in the preset wakeup word set.
Step 404, in response to determining that a wakeup word including the first speech recognition result exists in the preset wakeup word set, generating pre-wakeup information.
The above steps 401, 402, and 404 are respectively consistent with the steps 201, 202, 204, and their optional implementations in the foregoing embodiment, the above step 403 is consistent with the corresponding optional implementation in the foregoing embodiment, and the above descriptions for the steps 201, 202, 204, their optional implementations, and the optional implementation of the step 203 also apply to the steps 401, 402, 403, and 404, which are not described again here.
In some optional implementations, in response to determining that the pre-wake-up information is used to characterize that pre-wake-up is allowed, the execution body may further send response information characterizing that the electronic device is woken up. Based on the optional implementation manner, the electronic device may send response information of the wake-up device to the user when acquiring the corresponding pre-wake-up information. Compared with the method that the user waits for the user to speak the complete awakening word and then matches the awakening word, the response information is sent to the user, the time for the user to wait for the awakening response is shortened, the awakening rate of the electronic equipment is improved, and therefore the user experience is improved.
In response to determining that the pre-wake information is used to characterize the pre-wake allowed, a subsequent voice associated with the voice to be recognized is obtained, step 405.
In this embodiment, an executing entity (e.g., the server 105 shown in fig. 1) of the method for waking up the electronic device may retrieve subsequent speech associated with the speech to be recognized in response to determining that the pre-wake information is used to characterize the pre-wake allowed. Wherein, the acquisition time of the subsequent voice is generally later than the voice to be recognized.
It should be noted that, in the context of a continuous speech stream, the first frame of the subsequent speech may be generally consecutive in time with the last frame of the speech to be recognized.
In response to determining that the pre-wake-up information is used to characterize that pre-wake-up is allowed, the execution subject may obtain a pre-stored subsequent voice from a local location, as an example; as still another example, the execution subject may acquire the subsequent voice from the same device (e.g., a microphone) as the voice to be recognized.
Step 406, waking up the electronic device based on the matching between the voice to be recognized and the subsequent voice and the preset wake-up word set.
In this embodiment, based on matching between the to-be-recognized speech and the subsequent speech and the preset wake-up word set, the execution main body may wake up the electronic device in various ways. As an example, the execution subject may splice the speech to be recognized and the subsequent speech into a target speech according to the collection time. Then, the execution subject may perform speech recognition on the target speech to generate a target speech recognition result. Next, the executing body may determine whether the target speech recognition result includes a wakeup word in the preset wakeup word set. In response to determining to include, the execution body may be awakened.
In some optional implementation manners of this embodiment, the executing main body may further wake up the electronic device according to the following steps:
the first step is to perform voice recognition on the subsequent voice to generate a second voice recognition result.
In these implementations, the execution subject may generate the second speech recognition result in a manner similar to the generation of the first speech recognition result.
And secondly, splicing the first voice recognition result and the second voice recognition result according to the acquisition time to generate a target voice recognition result.
In these implementations, the executing body may splice the first speech recognition result obtained in the step 402 and the second speech recognition result obtained in the first step according to the speech acquisition time, so as to generate a target speech recognition result. As an example, the first speech recognition result may be "XX" and the second speech recognition result may be "classmate". Then, the execution agent may generate a target speech recognition result "XX classmate".
And thirdly, responding to the fact that the target voice recognition result is consistent with the matched awakening words, and awakening the electronic equipment.
In these implementations, the executing entity may wake the electronic device in response to determining that the target speech recognition result is consistent with the matching wake word.
As can be seen from fig. 4, the flow 400 of the method for waking up the electronic device in the present embodiment embodies the steps of acquiring the subsequent voice and waking up the electronic device according to the voice recognition result of the subsequent voice. Therefore, according to the scheme described in this embodiment, on the basis of waking up the electronic device according to the pre-wake-up information, the matching step of the complete wake-up word is added, so that the technical effect of giving consideration to both the wake-up rate and the wake-up accuracy is achieved.
With further reference to fig. 5, as an implementation of the methods shown in the above-mentioned figures, the present application provides an embodiment of an apparatus for waking up an electronic device, where the embodiment of the apparatus corresponds to the embodiment of the method shown in fig. 2, and the apparatus may be specifically applied to various electronic devices.
As shown in fig. 5, the apparatus 500 for waking up an electronic device provided in this embodiment includes an obtaining unit 501, an identifying unit 502, a determining unit 503, a generating unit 504, and a waking unit 505. The acquiring unit 501 is configured to acquire a voice to be recognized; a recognition unit 502 configured to recognize a speech to be recognized, and generate a first speech recognition result; a determining unit 503 configured to determine whether a wakeup word including a first speech recognition result exists in a preset wakeup word set, wherein the number of characters included in the wakeup word is greater than the number of characters included in the first speech recognition result; a generating unit 504 configured to generate pre-wakeup information in response to determining that a wakeup word including the first speech recognition result exists in a preset wakeup word set; a wake-up unit 505 configured to wake up the electronic device based on the pre-wake-up information.
In this embodiment, in the apparatus 500 for waking up an electronic device: the specific processing of the obtaining unit 501, the identifying unit 502, the determining unit 503, the generating unit 504, and the waking unit 505 and the technical effects thereof can refer to the related descriptions of step 201, step 202, step 203, step 204, and step 205 in the corresponding embodiment of fig. 2, respectively, and are not described herein again.
In some optional implementations of the embodiment, the determining unit 503 may be further configured to determine that a wake word including the character string in the first speech recognition result exists in the preset wake word set in response to determining that a matching wake word exists in the preset wake word set. The matched awakening word can contain a character string consistent with the first voice recognition result.
In some optional implementations of the present embodiment, the wake-up unit 505 may include an obtaining module (not shown in the figure) and a wake-up module (not shown in the figure). Wherein the obtaining module may be configured to obtain subsequent speech associated with the speech to be recognized in response to determining that the pre-wake information is used to characterize the pre-wake allowed. Wherein, the acquisition time of the subsequent voice can be later than the voice to be recognized. The wake-up module may be configured to wake up the electronic device based on a match between the speech to be recognized and the subsequent speech and a preset set of wake-up words.
In some optional implementations of the present embodiment, the apparatus 500 for waking up an electronic device may further include a sending unit (not shown in the figure). Wherein the transmitting unit may be configured to transmit response information indicating that the electronic device is woken up, in response to determining that the pre-wake-up information is used to indicate that pre-wake-up is allowed.
In some optional implementations of this embodiment, the wake-up module may include: an identification submodule (not shown), a splicing submodule (not shown), and a wake-up submodule (not shown). The recognition submodule can be configured to perform speech recognition on the subsequent speech to generate a second speech recognition result. The splicing submodule can be configured to splice the first speech recognition result and the second speech recognition result according to the acquisition time to generate a target speech recognition result. The wake-up sub-module may be configured to wake up the electronic device in response to determining that the target speech recognition result is consistent with the matching wake-up word.
The apparatus provided in the above embodiment of the present application first acquires the speech to be recognized through the acquisition unit 501. Then, the recognition unit 502 recognizes the speech to be recognized, and generates a first speech recognition result. Next, the determining unit 503 determines whether a wakeup word including the first speech recognition result exists in a preset wakeup word set, wherein the number of characters included in the wakeup word is greater than the number of characters included in the first speech recognition result. Then, the generating unit 504 generates pre-wakeup information in response to determining that a wakeup word including the first speech recognition result exists in the preset wakeup word set. The wake-up unit 504 wakes up the electronic device based on the pre-wake-up information. Therefore, the user can start to match the awakening words before the user speaks the complete awakening words, and the awakening speed of the electronic equipment is improved. From the perspective of the user, the time for waiting for the electronic equipment to wake up is reduced, and the user experience is improved.
Referring now to fig. 6, shown is a schematic diagram of an electronic device (e.g., terminal device 101 of fig. 1) 600 suitable for implementing embodiments of the present application. The terminal device in the embodiments of the present application may include, but is not limited to, a mobile terminal such as a mobile phone, a notebook computer, a PDA (personal digital assistant), a PAD (tablet), a smart speaker, and the like. The terminal device shown in fig. 6 is only an example, and should not bring any limitation to the functions and the scope of use of the embodiments of the present application.
As shown in fig. 6, the electronic apparatus 600 may include a Central Processing Unit (CPU) 601, a memory 602, an input unit 603, and an output unit 604, wherein the central processing unit 601, the memory 602, the input unit 603, and the output unit 604 are connected to each other through a bus 605. Here, the method according to an embodiment of the present application may be implemented as a computer program and stored in the memory 602. The central processor 601 in the electronic device 600 specifically implements the electronic device wake-up function defined in the method of the embodiment of the present application by calling the above-mentioned computer program stored in the memory 602. In some implementations, the input unit 603 can include a voice input device (e.g., a microphone). The output unit 604 may be a device such as a speaker or a display screen that can be used to broadcast voice or display images. Thus, the central processing unit 601, when calling the above-mentioned computer program to execute the text recognition function, can control the input unit 603 to acquire the voice to be recognized from the outside and control the output unit 604 to respond to the wake-up voice.
It should be noted that the computer readable medium described in the embodiments of the present application may be a computer readable signal medium or a computer readable storage medium or any combination of the two. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples of the computer readable storage medium may include, but are not limited to: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In embodiments of the application, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. In embodiments of the present application, however, a computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to: electrical wires, optical cables, RF (Radio Frequency), etc., or any suitable combination of the foregoing.
The computer readable medium may be embodied in the electronic device; or may exist separately without being assembled into the terminal device. The computer readable medium carries one or more programs which, when executed by the terminal device, cause the terminal device to: acquiring a voice to be recognized; recognizing the voice to be recognized to generate a first voice recognition result; determining whether a wakeup word containing a first voice recognition result exists in a preset wakeup word set, wherein the number of characters contained in the wakeup word is larger than that of characters contained in the first voice recognition result; responding to the fact that the awakening words containing the first voice recognition result exist in the preset awakening word set, and generating pre-awakening information; waking up the electronic device based on the pre-wake-up information.
Computer program code for carrying out operations for embodiments of the present application may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, smalltalk, C + +, and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any type of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet service provider).
The flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present application. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
The units described in the embodiments of the present application may be implemented by software or hardware. The described units may also be provided in a processor, and may be described as: a processor comprises an acquisition unit, an identification unit, a determination unit, a generation unit and a wake-up unit. Where the names of these units do not in some cases constitute a limitation on the unit itself, for example, the acquiring unit may also be described as "a unit that acquires speech to be recognized".
The above description is only a preferred embodiment of the application and is illustrative of the principles of the technology employed. It will be appreciated by those skilled in the art that the scope of the invention in the embodiments of the present application is not limited to the specific combination of the above-mentioned features, but also encompasses other embodiments in which any combination of the above-mentioned features or their equivalents is made without departing from the inventive concept as defined above. For example, the above features and (but not limited to) the features with similar functions disclosed in the embodiments of the present application are mutually replaced to form the technical solution.

Claims (8)

1. A method for waking an electronic device, comprising:
acquiring a voice to be recognized;
recognizing the voice to be recognized to generate a first voice recognition result;
determining whether a matched awakening word containing the first voice recognition result exists in a preset awakening word set, wherein the number of characters included in the matched awakening word is larger than the number of characters included in the first voice recognition result; the matched awakening words comprise character strings consistent with the first voice recognition result;
responding to the matched awakening words containing the first voice recognition result in the preset awakening word set, and generating pre-awakening information;
waking up the electronic device based on the pre-wake-up information;
the waking up the electronic device based on the pre-wake-up information includes:
in response to determining that the pre-awakening information is used for representing that pre-awakening is allowed, acquiring subsequent voice associated with the voice to be recognized, wherein the acquisition time of the subsequent voice is later than that of the voice to be recognized; the first frame of the subsequent voice is continuous in time with the last frame of the voice to be recognized;
and awakening the electronic equipment based on the matching between the voice to be recognized and the subsequent voice and the preset awakening word set.
2. The method of claim 1, wherein the method further comprises:
in response to determining that the pre-wake-up information is indicative of pre-wake-up being allowed, sending response information indicative of the electronic device being woken up.
3. The method according to claim 1 or 2, wherein waking up the electronic device based on the matching between the speech to be recognized and the subsequent speech and the preset set of wake-up words comprises:
performing voice recognition on the subsequent voice to generate a second voice recognition result;
splicing the first voice recognition result and the second voice recognition result according to the acquisition time to generate a target voice recognition result;
and in response to determining that the target speech recognition result is consistent with the matching wake-up word, waking up the electronic device.
4. An apparatus for waking an electronic device, comprising:
an acquisition unit configured to acquire a voice to be recognized;
the recognition unit is configured to recognize the voice to be recognized and generate a first voice recognition result;
a determining unit configured to determine whether a matching wake-up word including the first speech recognition result exists in a preset wake-up word set, wherein the number of characters included in the matching wake-up word is greater than the number of characters included in the first speech recognition result; the matched awakening words comprise character strings consistent with the first voice recognition result;
a generating unit configured to generate pre-wakeup information in response to determining that the matching wakeup word including the first voice recognition result exists in the preset wakeup word set;
a wake-up unit configured to wake up the electronic device based on the pre-wake-up information;
the wake-up unit includes:
an obtaining module configured to obtain a subsequent voice associated with the voice to be recognized in response to determining that the pre-wake information is used to characterize that pre-wake is allowed, wherein the subsequent voice is collected later than the voice to be recognized; the first frame of the subsequent voice is continuous in time with the last frame of the voice to be recognized;
a wake-up module configured to wake up the electronic device based on a match between the voice to be recognized and the subsequent voice and the preset set of wake-up words.
5. The apparatus of claim 4, wherein the apparatus further comprises:
a sending unit configured to send response information characterizing that the electronic device is woken up in response to determining that the pre-wake-up information is used to characterize that pre-wake-up is allowed.
6. The apparatus of claim 4 or 5, wherein the wake-up module comprises:
the recognition submodule is configured to perform voice recognition on the subsequent voice and generate a second voice recognition result;
the splicing submodule is configured to splice the first voice recognition result and the second voice recognition result according to the acquisition time to generate a target voice recognition result;
a wake-up sub-module configured to wake up the electronic device in response to determining that the target speech recognition result is consistent with the matching wake-up word.
7. A terminal, comprising:
one or more processors;
a storage device having one or more programs stored thereon;
the one or more programs, when executed by the one or more processors, cause the one or more processors to implement the method recited in any of claims 1-3.
8. A computer-readable medium, on which a computer program is stored which, when being executed by a processor, carries out the method according to any one of claims 1-3.
CN202010118496.8A 2020-02-26 2020-02-26 Method and apparatus for waking up an electronic device Active CN112306560B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010118496.8A CN112306560B (en) 2020-02-26 2020-02-26 Method and apparatus for waking up an electronic device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010118496.8A CN112306560B (en) 2020-02-26 2020-02-26 Method and apparatus for waking up an electronic device

Publications (2)

Publication Number Publication Date
CN112306560A CN112306560A (en) 2021-02-02
CN112306560B true CN112306560B (en) 2022-12-13

Family

ID=74336654

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010118496.8A Active CN112306560B (en) 2020-02-26 2020-02-26 Method and apparatus for waking up an electronic device

Country Status (1)

Country Link
CN (1) CN112306560B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117409779B (en) * 2023-12-14 2024-02-23 四川蜀天信息技术有限公司 Voice wakeup method, device, system and readable medium

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110225386B (en) * 2019-05-09 2021-09-14 海信视像科技股份有限公司 Display control method and display device

Also Published As

Publication number Publication date
CN112306560A (en) 2021-02-02

Similar Documents

Publication Publication Date Title
CN110047481B (en) Method and apparatus for speech recognition
CN107863108B (en) Information output method and device
US11189262B2 (en) Method and apparatus for generating model
CN108665895B (en) Method, device and system for processing information
US20240021202A1 (en) Method and apparatus for recognizing voice, electronic device and medium
JP2021196599A (en) Method and apparatus for outputting information
CN112259089A (en) Voice recognition method and device
CN111312233A (en) Voice data identification method, device and system
CN111625649A (en) Text processing method and device, electronic equipment and medium
CN112364144A (en) Interaction method, device, equipment and computer readable medium
CN110138654B (en) Method and apparatus for processing speech
CN111326146A (en) Method and device for acquiring voice awakening template, electronic equipment and computer readable storage medium
CN108962226B (en) Method and apparatus for detecting end point of voice
CN112306560B (en) Method and apparatus for waking up an electronic device
CN114242065A (en) Voice wake-up method and device and training method and device of voice wake-up module
CN111312243B (en) Equipment interaction method and device
CN110223694B (en) Voice processing method, system and device
CN113053390A (en) Text processing method and device based on voice recognition, electronic equipment and medium
CN111276127A (en) Voice awakening method and device, storage medium and electronic equipment
CN112328308A (en) Method and device for recognizing text
CN107895573B (en) Method and device for identifying information
CN111899747B (en) Method and apparatus for synthesizing audio
CN112652329B (en) Text realignment method and device, electronic equipment and storage medium
CN114333017A (en) Dynamic pickup method and device, electronic equipment and storage medium
CN111968630B (en) Information processing method and device and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant