CN112489650A - Wake-up control method and device, storage medium and terminal - Google Patents

Wake-up control method and device, storage medium and terminal Download PDF

Info

Publication number
CN112489650A
CN112489650A CN202011349897.0A CN202011349897A CN112489650A CN 112489650 A CN112489650 A CN 112489650A CN 202011349897 A CN202011349897 A CN 202011349897A CN 112489650 A CN112489650 A CN 112489650A
Authority
CN
China
Prior art keywords
terminal
audio data
wake
historical
awakening
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011349897.0A
Other languages
Chinese (zh)
Inventor
程思
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Xiaomi Pinecone Electronic Co Ltd
Original Assignee
Beijing Xiaomi Pinecone Electronic Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Xiaomi Pinecone Electronic Co Ltd filed Critical Beijing Xiaomi Pinecone Electronic Co Ltd
Priority to CN202011349897.0A priority Critical patent/CN112489650A/en
Publication of CN112489650A publication Critical patent/CN112489650A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Abstract

The present disclosure relates to a wake-up control method, apparatus, storage medium and terminal, where the method is applied to a first terminal, and includes: collecting audio data in real time; under the condition that first audio data collected in a target time period meet a preset awakening condition, obtaining first historical audio data in a stored historical time period, wherein the historical time period comprises a preset time period before the target time period; acquiring a first voiceprint characteristic corresponding to the first historical audio data, and sending the first voiceprint characteristic to a second terminal, so that the second terminal acquires stored second historical audio data in the historical time period after receiving the first voiceprint characteristic, and sends awakening response information to the first terminal according to the second historical audio data; receiving the awakening response information sent by the second terminal; according to the awakening response information, determining a terminal to be awakened from the first terminal and the second terminal; and controlling the terminal to be awakened.

Description

Wake-up control method and device, storage medium and terminal
Technical Field
The present disclosure relates to the field of terminal technologies, and in particular, to a wake-up control method and apparatus, a storage medium, and a terminal.
Background
Along with the development of science and technology, intelligent pronunciation product is more and more popularized, and common intelligent speech equipment includes intelligent audio amplifier, and children accompany the robot, intelligent alarm clock, intelligent household electrical appliances etc.. Under the general condition, the preset awakening words in different intelligent devices can be the same, so that a plurality of intelligent voice devices can exist in the same space at the same time, and when a user speaks the same awakening word, the intelligent devices can respond at the same time, so that the user is troubled easily, and the interaction experience of the user is seriously influenced.
In the related art, in order to avoid that a plurality of intelligent voice devices simultaneously respond to the awakening of a user, the intelligent voice device closest to the user can be selected from the plurality of intelligent voice devices to respond to the awakening of the user through a nearby awakening judgment method, however, when only part of the plurality of intelligent voice devices can be awakened according to an awakening word spoken by the user, the awakened intelligent voice devices need to respond to the awakening of the user after an overtime judgment timer is overtime, so that the awakening response speed is low, and the user experience is influenced.
Disclosure of Invention
To overcome the problems in the related art, the present disclosure provides a wake-up control method, apparatus, storage medium, and terminal.
According to a first aspect of the embodiments of the present disclosure, there is provided a wake-up control method applied to a first terminal, the method including: collecting audio data in real time; under the condition that first audio data collected in a target time period meet a preset awakening condition, obtaining first historical audio data in a stored historical time period, wherein the historical time period comprises a preset time period before the target time period; acquiring a first voiceprint feature corresponding to the first historical audio data, and sending the first voiceprint feature to a second terminal, so that the second terminal acquires stored second historical audio data in the historical time period after receiving the first voiceprint feature, and sends awakening response information to the first terminal according to the second historical audio data; receiving the awakening response information sent by the second terminal; determining a terminal to be awakened from the first terminal and the second terminal according to the awakening response information; and controlling the terminal to be awakened.
Optionally, the preset wake-up condition includes: the first audio data comprise preset awakening words; or the first audio data comprises target keywords in the preset awakening words, the target keywords comprise the first N keywords in the preset awakening words, N is an integer which is greater than 0 and smaller than a target number, and the target number is the number of the keywords in the preset awakening words.
Optionally, the wake response information includes a wake rejection information or a wake characteristic of the second historical audio data; determining, according to the wake-up response information, a terminal to be woken up from the first terminal and the second terminal includes: under the condition that the awakening response information comprises the awakening feature of the second historical audio data, determining a target terminal closest to a user from the first terminal and the second terminal according to the awakening feature of the first audio data and the awakening feature of the second historical audio data, and taking the target terminal as the terminal to be awakened, wherein the awakening feature is used for representing the distance between the terminal and the user; or, the first terminal is used as the terminal to be awakened when the awakening response message includes the awakening refusing message.
Optionally, the controlling the terminal to be woken up to wake up includes: controlling the first terminal to wake up under the condition that the terminal to be woken up is the first terminal; or sending a wake-up instruction to the second terminal under the condition that the terminal to be woken up is the second terminal, so that the second terminal can be woken up according to the wake-up instruction.
According to a second aspect of the embodiments of the present disclosure, there is provided a wake-up control method applied to a second terminal, the method including: collecting audio data in real time; under the condition that a first voiceprint characteristic corresponding to first historical audio data collected in a historical time period and sent by a first terminal is received, second historical audio data in the historical time period are obtained; and sending awakening response information to the first terminal according to the second historical audio data, so that the first terminal determines a terminal to be awakened according to the awakening response information and controls the terminal to be awakened.
Optionally, the wake-up response message includes a wake-up rejection message or a wake-up feature of the second historical audio data, and before the wake-up response message is sent to the first terminal according to the second historical audio data, the method further includes: determining whether to awaken the second terminal according to the second historical audio data; the sending of the wake-up response information to the first terminal according to the second historical audio data includes: sending a wake-up feature of the second historical audio data to the first terminal on a condition that the second terminal is determined to be woken up; or sending the awakening refusing information to the first terminal under the condition of determining not to awaken the second terminal.
Optionally, before the determining whether to wake up the second terminal according to the second historical audio data, the method further includes: determining whether the second historical audio data comprises voice data; the determining whether to wake up the second terminal according to the second historical audio data includes: under the condition that the second historical audio data comprises the voice data, acquiring a second voiceprint feature corresponding to the second historical audio data, and determining whether to awaken the second terminal or not according to the first voiceprint feature and the second voiceprint feature; or, in a case that the second history audio data does not include the voice data, determining not to wake up the second terminal.
Optionally, the determining whether to wake up the second terminal according to the first voiceprint feature and the second voiceprint feature includes: determining whether the first historical audio data and the second historical audio data belong to the same user according to the first voiceprint feature and the second voiceprint feature; and when the first historical audio data and the second historical audio data belong to the same user, determining to awaken the second terminal, or when the first historical audio data and the second historical audio data do not belong to the same user, determining not to awaken the second terminal.
Optionally, the method further comprises: receiving a wake-up instruction sent by the first terminal under the condition that the terminal to be woken up is the second terminal; and awakening the second terminal according to the awakening instruction.
According to a third aspect of the embodiments of the present disclosure, there is provided a wake-up control apparatus, applied to a first terminal, the apparatus including: a first data acquisition module configured to acquire audio data in real time; the first data acquisition module is configured to acquire first historical audio data in a stored historical time period under the condition that first audio data acquired in a target time period meet a preset awakening condition, wherein the historical time period comprises a preset time period before the target time period; the voiceprint feature acquisition module is configured to acquire a first voiceprint feature corresponding to the first historical audio data and send the first voiceprint feature to a second terminal, so that the second terminal acquires stored second historical audio data in the historical time period after receiving the first voiceprint feature and sends awakening response information to the first terminal according to the second historical audio data; an information receiving module configured to receive the wake-up response information sent by the second terminal; a terminal determining module configured to determine a terminal to be awakened from the first terminal and the second terminal according to the awakening response information; and the first awakening module is configured to control the terminal to be awakened.
Optionally, the preset wake-up condition includes: the first audio data comprise preset awakening words; or the first audio data comprises target keywords in the preset awakening words, the target keywords comprise the first N keywords in the preset awakening words, N is an integer which is greater than 0 and smaller than a target number, and the target number is the number of the keywords in the preset awakening words.
Optionally, the wake response information includes a wake rejection information or a wake characteristic of the second historical audio data; the terminal determination module includes: the terminal determination submodule is configured to determine a target terminal closest to a user from the first terminal and the second terminal according to the wake-up feature of the first audio data and the wake-up feature of the second historical audio data under the condition that the wake-up response information includes the wake-up feature of the second historical audio data, and use the target terminal as the terminal to be woken up, wherein the wake-up feature is used for representing the distance between the terminal and the user; or, the first terminal is used as the terminal to be awakened when the awakening response message includes the awakening refusing message.
Optionally, the first wake-up module includes: the wake-up sub-module is configured to control the first terminal to wake up when the terminal to be woken up is the first terminal; or sending a wake-up instruction to the second terminal under the condition that the terminal to be woken up is the second terminal, so that the second terminal can be woken up according to the wake-up instruction.
According to a fourth aspect of the embodiments of the present disclosure, there is provided a wake-up control apparatus, applied to a second terminal, the apparatus including: a second data acquisition module configured to acquire audio data in real time; the second data acquisition module is configured to acquire stored second historical audio data in a historical time period when a first voiceprint feature corresponding to first historical audio data acquired in the historical time period and sent by a first terminal is received; and the information sending module is configured to send awakening response information to the first terminal according to the second historical audio data, so that the first terminal determines a terminal to be awakened according to the awakening response information and controls the terminal to be awakened.
Optionally, the wake response message includes a wake rejection message or a wake characteristic of the second historical audio data, and the apparatus further includes: a wake-up determination module configured to determine whether to wake up the second terminal according to the second historical audio data; the information sending module is configured to: sending a wake-up feature of the second historical audio data to the first terminal on a condition that the second terminal is determined to be woken up; or sending the awakening refusing information to the first terminal under the condition of determining not to awaken the second terminal.
Optionally, the apparatus further comprises: a data determination module configured to determine whether the second historical audio data comprises voice data; the wake determination module configured to: under the condition that the second historical audio data comprises the voice data, acquiring a second voiceprint feature corresponding to the second historical audio data, and determining whether to awaken the second terminal or not according to the first voiceprint feature and the second voiceprint feature; or, in a case that the second history audio data does not include the voice data, determining not to wake up the second terminal.
Optionally, the wake-up determining module includes: the user judgment sub-module is configured to determine whether the first historical audio data and the second historical audio data belong to the same user according to the first voiceprint feature and the second voiceprint feature; a wake-up determination sub-module configured to determine to wake up the second terminal in a case where it is determined that the first and second historical audio data belong to the same user, or determine not to wake up the second terminal in a case where it is determined that the first and second historical audio data do not belong to the same user.
Optionally, the apparatus further comprises: the instruction receiving module is configured to receive a wake-up instruction sent by the first terminal under the condition that the terminal to be woken up is the second terminal; and the second awakening module is configured to awaken the second terminal according to the awakening instruction.
According to a fifth aspect of embodiments of the present disclosure, there is provided a computer readable storage medium having stored thereon computer program instructions which, when executed by a processor, implement the steps of the wake-up control method provided by the first aspect of the present disclosure, or which, when executed by a processor, implement the steps of the wake-up control method provided by the second aspect of the present disclosure.
According to a sixth aspect of the embodiments of the present disclosure, there is provided a terminal, including: a memory having a computer program stored thereon; a processor for executing the computer program in the memory to implement the steps of the wake-up control method provided by the first aspect of the present disclosure, or to implement the steps of the wake-up control method provided by the second aspect of the present disclosure.
The technical scheme provided by the embodiment of the disclosure can have the following beneficial effects: by collecting audio data in real time; under the condition that first audio data collected in a target time period meet a preset awakening condition, obtaining first historical audio data in a stored historical time period, wherein the historical time period comprises a preset time period before the target time period; acquiring a first voiceprint feature corresponding to the first historical audio data, and sending the first voiceprint feature to a second terminal, so that the second terminal acquires stored second historical audio data in the historical time period after receiving the first voiceprint feature, and sends awakening response information to the first terminal according to the second historical audio data; receiving the awakening response information sent by the second terminal; determining a terminal to be awakened from the first terminal and the second terminal according to the awakening response information; and controlling the terminal to be awakened. That is to say, the first terminal can send the first voiceprint feature to the second terminal after meeting the preset wake-up condition, the second terminal can send the wake-up response information to the first terminal according to the second historical audio data after receiving the first voiceprint feature, and thus, the first terminal can determine the terminal to be woken up after receiving the wake-up response information sent by the second terminal without waiting for the timeout decision timer to be overtime, and control the terminal to be woken up to wake up, so that the wake-up response speed can be increased, and the user experience can be improved.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the disclosure.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the present disclosure and together with the description, serve to explain the principles of the disclosure.
FIG. 1 is a flow chart illustrating a wake-up control method according to an exemplary embodiment;
FIG. 2 is a flow chart illustrating a second wake-up control method according to an exemplary embodiment;
FIG. 3 is a flow diagram illustrating a third wake-up control method in accordance with an exemplary embodiment;
FIG. 4 is a flow diagram illustrating a fourth wake-up control method in accordance with an exemplary embodiment;
FIG. 5 is a schematic diagram illustrating a wake-up control apparatus according to an exemplary embodiment;
fig. 6 is a schematic structural diagram illustrating a second wake-up control apparatus according to an exemplary embodiment;
FIG. 7 is a schematic diagram illustrating a third wake-up control apparatus according to an exemplary embodiment;
FIG. 8 is a schematic diagram illustrating a fourth wake-up control apparatus according to an exemplary embodiment;
fig. 9 is a schematic structural diagram illustrating a fifth wake-up control apparatus according to an exemplary embodiment;
fig. 10 is a block diagram illustrating a terminal according to an example embodiment.
Detailed Description
Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, like numbers in different drawings represent the same or similar elements unless otherwise indicated. The implementations described in the exemplary embodiments below are not intended to represent all implementations consistent with the present disclosure. Rather, they are merely examples of apparatus and methods consistent with certain aspects of the present disclosure, as detailed in the appended claims.
In the description that follows, the terms "first," "second," and the like are used for descriptive purposes only and are not intended to indicate or imply relative importance nor order to be construed.
First, an application scenario of the present disclosure will be explained. In general, the wake-up word is preset in the smart voice device by the user, or the wake-up word is set before the smart voice device leaves the factory, and therefore, the wake-up words preset in different smart voice devices may be the same. Along with the popularization of intelligent voice products, a plurality of intelligent voice devices may exist in the same space at the same time, so that after a user speaks the same awakening word, the plurality of intelligent voice devices in the same space may respond at the same time to interact with the user, which easily causes user trouble and affects user interaction experience.
In the related technology, a multi-device linkage system can be established, that is, a plurality of intelligent voice devices in the same space form a linkage system through the same account number, after a user speaks a wakeup word, each intelligent voice device in the linkage system can determine whether to be awakened or not according to the wakeup word, and after the wakeup word is determined, an overtime decision timer can be started, and a wakeup result is sent to other intelligent voice devices. If the awakening results sent by other intelligent voice equipment are received before the overtime decision timer is overtime, the intelligent voice equipment closest to the user can be determined to respond to the awakening of the user from the plurality of awakened intelligent voice equipment; if the awakening result sent by other intelligent voice equipment is not received before the timeout decision timer times out, that is, one intelligent voice equipment in the linkage system is awakened, the awakened intelligent voice equipment needs to wait until the timeout decision timer times out, and then the awakening of the user is responded. Therefore, the awakening response speed of the whole linkage system is low, and the user experience is influenced.
In order to solve the above problems, the present disclosure provides a wake-up control method, apparatus, storage medium, and terminal, where a first terminal may send a first voiceprint feature to a second terminal after meeting a preset wake-up condition, and the second terminal may send a wake-up response message to the first terminal according to second historical audio data collected by the second terminal in a historical time period after receiving the first voiceprint feature, so that it is not necessary to wait for a timeout decision timer to time out, and after receiving the wake-up response message sent by the second terminal, the first terminal may determine a terminal to be wakened up and control the terminal to be wakened up to wake up, thereby improving a speed of wake-up response and improving user experience.
The first terminal and the second terminal of the present disclosure may belong to the same multi-device linkage system, and the types of the first terminal and the second terminal may be the same or different, and the first terminal and the second terminal include but are not limited to a fixed device and a mobile device, for example, the fixed device includes but is not limited to: personal Computers (PC), televisions, air conditioners, wall-mounted ovens, and the like; the mobile devices include, but are not limited to: cell-phone, panel computer, wearable equipment, audio amplifier, alarm clock etc. this disclosure does not limit to this. The multi-device linkage system may determine a terminal that is to respond to a user's wake-up from among a plurality of terminals using a nearby wake-up function. For example, both the first terminal and the second terminal may belong to a device in a distributed far-field device networking. The second terminal can comprise a plurality of terminals, and the first terminal and the second terminal are in the same space and form the multi-device linkage system through the same account. In addition, the wake-up response speed of the first terminal is faster than that of the second terminal in the present disclosure, for example, the performance of the wake-up engine of the first terminal is better than that of the wake-up engine of the second terminal. That is, the first terminal may wake up more quickly after the user is speaking the wake-up word.
The present disclosure is described below with reference to specific examples.
Fig. 1 is a flowchart illustrating a wake-up control method according to an exemplary embodiment, which is applied to a first terminal, and as shown in fig. 1, the method includes:
and S101, collecting audio data in real time.
In this step, after the first terminal is powered on and started, the data acquisition module of the first terminal may acquire audio data in real time.
S102, under the condition that the first audio data collected in the target time period meet a preset awakening condition, the first historical audio data in the stored historical time period are obtained.
The historical time period may include a preset time period before the target time period, where the preset time period may be a time period required for the user to finish speaking the wakeup word, and the time periods of the historical time periods corresponding to different wakeup words are also different.
It should be noted that, in the process of acquiring audio data in real time, the data acquisition module of the first terminal may store first historical audio data acquired in a historical time period. For example, the present disclosure may store the first historical audio data in a queue, where the audio data at the head of the queue is the audio data obtained earliest in the first historical audio data, and the audio data at the tail of the queue is the audio data obtained latest in the first historical audio data, and after new audio data is acquired each time, the audio data at the head of the queue may be deleted, and the new audio data may be stored at the tail of the queue.
In this step, after the first terminal collects the first audio data in the target time period, a preset wake-up word may be obtained, and whether the first audio data meets a preset wake-up condition is determined according to the preset wake-up word. Wherein, the preset wake-up condition may include: the first audio data comprises the preset awakening word; or the first audio data includes target keywords in the preset wake-up word, the target keywords include first N keywords in the preset wake-up word, N is an integer greater than 0 and smaller than a target number, and the target number is the number of the keywords in the preset wake-up word. Here, the value of N may be determined according to the type of the first terminal, for example, in a case that the preset wakeup word includes 5 keywords, a smaller value may be set for waking up a first terminal with a higher requirement on real-time performance, for example, N may be set to 2; for the first terminal with lower wake-up real-time requirement, a larger value may be set, for example, N may be set to 4, which is not limited in this disclosure. In addition, considering that the smaller the value of N, the higher the probability that the first terminal is awoken by mistake, therefore, the value of N may also be determined according to the requirement of the first terminal on the awkeful rate, for example, in the case that the preset awkeful word includes 5 keywords, a larger value, for example, 4, may be set for the first terminal with a higher requirement on the awkeful rate; for the first terminal with a lower requirement on the false wake-up rate, a smaller value may be set, for example, to 2, which is not limited by this disclosure.
In a case that the preset wake-up condition is that the first audio data includes the preset wake-up word, the wake-up engine of the first terminal determines that the first audio data satisfies the preset wake-up condition only after detecting each keyword in the preset wake-up word, for example, if the preset wake-up word is "ABCDE", the first audio data satisfies the preset wake-up condition when the first audio data includes "ABCDE". In a case that the preset wake-up condition is that the first audio data includes a target keyword in the preset wake-up word, the wake-up engine of the first terminal may determine that the first audio data satisfies the preset wake-up condition after detecting the target keyword, for example, if the preset wake-up word is "ABCDE" and the target keyword is "ABC", it indicates that the first audio data satisfies the preset wake-up condition when the first audio data includes "ABC".
After the first terminal determines that the first audio data meets the preset awakening condition, the first historical audio data needs to be acquired, the first voiceprint feature is extracted from the first historical audio data, and a certain time is required for extracting the first voiceprint feature.
Further, in the case that it is determined that the first audio data meets the preset wake-up condition, the stored first historical audio data in the historical time period may be acquired.
S103, acquiring a first voiceprint feature corresponding to the first historical audio data, and sending the first voiceprint feature to the second terminal.
In this step, after the first historical audio data is obtained, a first voiceprint feature corresponding to the first historical audio data may be extracted by a related art method, and the first voiceprint feature may be sent to the second terminal. After receiving the first voiceprint feature sent by the first terminal, the second terminal may first obtain second historical audio data stored in the historical time period, and send wakeup response information to the first terminal according to the second historical audio data.
And S104, receiving the awakening response information sent by the second terminal.
And S105, determining a terminal to be awakened from the first terminal and the second terminal according to the awakening response information.
The wake-up response information may include a wake-up rejection information or a wake-up feature of the second historical audio data, and when the wake-up response information includes the wake-up feature of the second historical audio data, it indicates that the second terminal may also wake up according to the second audio data collected in the target time period; and in the case that the wake-up response message includes the refusal wake-up message, indicating that the second terminal is not woken up by the second audio data collected in the target time period.
In this step, after receiving the wake-up response message sent by the second terminal, the first terminal may determine, according to the wake-up response message, a terminal that can be woken up by the audio data collected in the target time period. If only the first terminal is awakened, the first terminal can be used as the terminal to be awakened, and if a plurality of terminals are awakened, a target terminal closest to a user can be selected from the plurality of awakened terminals and used as the terminal to be awakened.
In a possible implementation manner, in a case that the wake-up response information includes a wake-up feature of the second historical audio data, a wake-up feature of the first audio data may be obtained, and according to the wake-up feature of the first audio data and the wake-up feature of the second historical audio data, a target terminal closest to a user is determined from the first terminal and the second terminal, and the target terminal is taken as the terminal to be woken up, where the wake-up feature is used to represent a distance between a terminal and the user. Here, when the wake-up response information includes the wake-up feature of the second historical audio data, it indicates that the second terminal may be woken up, in which case the first terminal may determine a target terminal closest to a user according to the wake-up feature of the first audio data and the wake-up feature of the second historical audio data, and in the case where the wake-up feature is the audio energy of the audio data, the first terminal may compare the audio energy of the first audio data and the audio energy of the second historical audio data, determine a target terminal having the largest audio energy from the first terminal and the second terminal, and regard the target terminal as the terminal to be woken up.
In the case that the wake-up response message includes the refusal wake-up message, it indicates that the second terminal is not to be woken up, in which case, the first terminal may be regarded as the terminal to be woken up.
It should be noted that the wake-up feature of the first audio data may be obtained when the first terminal determines that the first audio data meets the preset wake-up condition, and in addition, in consideration that the first historical audio data also includes a wake-up word spoken by the user, in the case that the wake-up response information includes the wake-up feature of the second historical audio data, the wake-up feature of the first historical audio data may also be obtained, and according to the wake-up feature of the first historical audio data and the wake-up feature of the second historical audio data, a target terminal closest to the user is determined from the first terminal and the second terminal, and the target terminal is taken as the terminal to be woken up.
And S106, controlling the terminal to be awakened.
In this step, after determining the terminal to be awakened, the first terminal may control the terminal to be awakened to respond to the awakening of the user according to a preset response instruction. In a possible implementation manner, in a case that the terminal to be awakened is the first terminal, the first terminal may be controlled to be awakened; or, in a case that the terminal to be wakened is the second terminal, a wake-up instruction may be sent to the second terminal, so that the second terminal wakes up according to the wake-up instruction. For example, when the terminal to be woken up is the first terminal, the first terminal may directly output a response instruction, for example, "i am", and when the terminal to be woken up is the second terminal, indicating that the second terminal is closer to the user, the first terminal may transmit a wake-up instruction to the second terminal, and the second terminal may output the response instruction, for example, "i am", after receiving the wake-up instruction.
It should be noted that the above-mentioned manner for controlling the terminal to be awakened to wake up is only an example, and the terminal to be awakened to wake up may also be awakened by other manners of the related art, which is not limited in this disclosure.
By adopting the method, the first terminal can send the first voiceprint feature to the second terminal after meeting the preset awakening condition, the second terminal can acquire the stored second historical audio data in the historical time period after receiving the first voiceprint feature, and sends the awakening response information to the first terminal according to the second historical audio data, so that the first terminal can determine the terminal to be awakened after receiving the awakening response information sent by the second terminal without waiting for the overtime judgment timer to be overtime, and control the terminal to be awakened, thereby improving the awakening response speed and improving the user experience.
Fig. 2 is a flowchart illustrating a second wake-up control method according to an exemplary embodiment, which is applied to a second terminal, and as shown in fig. 2, the method includes:
s201, collecting audio data in real time.
S202, under the condition that a first voiceprint feature corresponding to first historical audio data collected in a historical time period and sent by a first terminal is received, second historical audio data in the historical time period are obtained.
The historical time period may include a preset time period before the target time period, where the preset time period may be a time period required for the user to finish speaking the wakeup word, and the time periods of the historical time periods corresponding to different wakeup words are also different.
It should be noted that, in the process of acquiring audio data in real time, the data acquisition module of the second terminal may store second historical audio data acquired within a historical time period, where the manner of storing the second historical audio data may refer to the manner of storing the first historical audio data in step S102, and is not described herein again.
In this step, after the second terminal receives the first voiceprint feature corresponding to the first historical audio data sent by the first terminal, it indicates that the first terminal has been awakened and the second terminal has not been awakened.
And S203, sending awakening response information to the first terminal according to the second historical audio data.
Wherein the wake response information may include a rejection wake information or a wake characteristic of the second historical audio data.
In this step, after acquiring the second historical audio data, the second terminal may determine whether the second terminal can be woken up according to the second historical audio data. The wake-up feature of the second historical audio data may be obtained and sent to the first terminal if it is determined that the second terminal may be woken up, and the wake-up rejection message may be sent to the first terminal if it is determined that the second terminal may not be woken up.
In order to determine whether the second terminal can be woken up, in a possible implementation manner, it may be determined whether the second historical audio data includes voice data, in which case the second historical audio data does not include voice data, indicating that the second terminal is not woken up, and the wake-up rejection message may be sent to the first terminal.
In the case where the second history audio data includes voice data, it indicates that the second terminal may be awakened, but, in order to avoid the second terminal being awakened by mistake, it may be further determined whether the second terminal may be awakened in the case where the second history audio data includes voice data, considering that the voice included in the second history audio data may not be the same person as the voice included in the first history audio data. In a possible implementation manner, when the second historical audio data includes voice data, a second voiceprint feature corresponding to the second historical audio data may be obtained, and whether to wake up the second terminal is determined according to the first voiceprint feature and the second voiceprint feature.
Since the voiceprint features of different speakers are different, here, it may be determined whether the first and second history audio data belong to the same user according to the first and second voiceprint features, and in a case where it is determined that the first and second history audio data belong to the same user, it may be determined to wake up the second terminal, or, in a case where it is determined that the first and second history audio data do not belong to the same user, it may be determined not to wake up the second terminal. Determining whether the first voiceprint feature and the second voiceprint feature belong to the same speaker, and under the condition that the first voiceprint feature and the second voiceprint feature belong to the same speaker, determining that the first historical audio data and the second historical audio data belong to the same user, so as to determine to awaken the second terminal, obtain an awakening feature of the second historical audio data, and send the awakening feature of the second historical audio data to the first terminal; in a case where it is determined that the first voiceprint feature and the second voiceprint feature do not belong to the same speaker, it may be determined that the first historical audio data and the second historical audio data do not belong to the same user, thereby determining not to wake up the second terminal, and transmitting the wake-up rejection message to the first terminal.
It should be noted that, the manner in which the second terminal obtains the wake-up feature of the second historical audio data may refer to the manner in which the first terminal obtains the wake-up feature of the first historical audio data, and is not described herein again.
After receiving the wake-up response message sent by the second terminal, the first terminal may determine the terminal to be woken up according to the wake-up response message. In a case where the wake-up response information includes the refusal wake-up information, the first terminal may be regarded as the terminal to be woken up, and in a case where the wake-up response information includes the wake-up feature of the second historical audio data, the terminal to be woken up may be determined from the first terminal and the second terminal according to the wake-up feature of the first historical audio data and the wake-up feature of the second historical audio data.
After determining the terminal to be awakened, the first terminal may control the terminal to be awakened to respond to the awakening of the user according to a preset response instruction. In a case that the terminal to be woken up is the first terminal, the first terminal may directly respond to the user's waking up, for example, the first terminal may directly output a response instruction, for example, "i am", and in a case that the terminal to be woken up is the second terminal, the second terminal may receive the wake up instruction sent by the first terminal, wake up the second terminal according to the wake up instruction, respond to the user's waking up, for example, the second terminal may output the response instruction after receiving the wake up instruction.
By adopting the method, the second terminal can acquire the second historical audio data in the stored historical time period after receiving the first voiceprint characteristic corresponding to the first historical audio data sent by the first terminal, and sends the awakening response information to the first terminal according to the second historical audio data, so that the first terminal does not need to wait for the overtime judgment timer to be overtime, can directly determine the terminal to be awakened according to the awakening response information, and controls the terminal to be awakened, thereby improving the awakening response speed and improving the user experience.
Fig. 3 is a flowchart illustrating a third wake-up control method according to an exemplary embodiment, as shown in fig. 3, the method includes:
s301, the first terminal and the second terminal collect audio data in real time.
S302, the first terminal acquires first historical audio data in a stored historical time period under the condition that the first audio data collected in the target time period meet a preset awakening condition.
The historical time period may include a preset time period before the target time period, where the preset time period may be a time period when the user finishes speaking the awakening word, and time periods of historical time periods corresponding to different awakening words are also different.
S303, the first terminal acquires a first voiceprint feature corresponding to the first historical audio data and sends the first voiceprint feature to the second terminal.
S304, the second terminal acquires the stored second historical audio data in the historical time period under the condition that the first voiceprint feature corresponding to the first historical audio data collected in the historical time period sent by the first terminal is received.
S305, the second terminal determines whether the second history audio data includes voice data, and if the second history audio data includes voice data, executes steps S306 to S309, and if the second history audio data does not include voice data, determines not to wake up the second terminal, and executes step S309.
And S306, the second terminal acquires a second voiceprint characteristic corresponding to the second historical audio data.
And S307, the second terminal determines whether to awaken the second terminal according to the first voiceprint feature and the second voiceprint feature, executes the step S308 under the condition that the second terminal is determined to be awakened, and executes the step S309 under the condition that the second terminal is not determined to be awakened.
S308, the second terminal sends the awakening characteristic of the second historical audio data to the first terminal.
S309, the second terminal sends the awakening refusing information to the first terminal.
And S310, after receiving the awakening response information sent by the second terminal, the first terminal determines the terminal to be awakened according to the awakening response information.
In this step, in a case that the wake-up response information includes the wake-up feature of the second historical audio data, the first terminal may obtain the wake-up feature of the first audio data, determine, according to the wake-up feature of the first audio data and the wake-up feature of the second historical audio data, a target terminal closest to a user from the first terminal and the second terminal, and use the target terminal as the terminal to be woken up; in the case that the wake-up response message includes the refusal wake-up message, it indicates that the second terminal is not to be woken up, in which case, the first terminal may be regarded as the terminal to be woken up.
S311, the first terminal determines whether the terminal to be woken up is the first terminal, if the terminal to be woken up is the first terminal, step S312 is executed, and if the terminal to be woken up is the second terminal, step S313 is executed.
And S312, the first terminal controls the first terminal to wake up.
S313, the first terminal sends a wake-up command to the second terminal.
And S314, after receiving the wake-up instruction sent by the first terminal, the second terminal wakes up the second terminal according to the wake-up instruction.
The specific implementation manners of the steps S301 to S314 may refer to the specific implementation manners of the embodiment shown in fig. 1 and the embodiment shown in fig. 2, and are not described herein again.
By adopting the method, the first terminal can send the first voiceprint feature to the second terminal after meeting the preset awakening condition, the second terminal can acquire the second historical audio data in the stored historical time period after receiving the first voiceprint feature, and determine whether to awaken the second terminal according to the second historical audio data, and can send the awakening refusing information to the first terminal under the condition of determining not to awaken the second terminal; further, the second terminal may obtain a second voiceprint feature corresponding to the second historical audio data when it is determined that the second terminal may be awakened according to the second historical audio data, and determine whether to awaken the second terminal according to the first voiceprint feature and the second voiceprint feature, and may send the awakening feature of the second historical audio data to the first terminal when it is determined that the second terminal is awakened, and may send the awakening rejection message to the first terminal when it is determined that the second terminal is not awakened.
It should be noted that, in the wake-up control method of the embodiment shown in fig. 3, the first terminal is first woken up, and the second terminal is not yet woken up when receiving the first voiceprint feature sent by the first terminal, but considering that the second terminal may already be in a pre-wake-up state when receiving the first voiceprint feature sent by the first terminal, that is, the second terminal is woken up by the wake-up word spoken by the user, but has not yet responded to the user, in this case, the terminal to be woken up in response to user wake-up may also be determined from the first terminal and the second terminal through the second terminal. In one possible implementation, fig. 4 is a flowchart illustrating a fourth wake-up control method according to an exemplary embodiment, and as shown in fig. 4, the method includes:
s401, the first terminal and the second terminal collect audio data in real time.
S402, the first terminal acquires first historical audio data in a stored historical time period under the condition that the first audio data acquired in the target time period meet a preset awakening condition.
The historical time period may include a preset time period before the target time period, where the preset time period may be a time period when the user finishes speaking the awakening word, and time periods of historical time periods corresponding to different awakening words are also different.
S403, the first terminal obtains a first voiceprint feature corresponding to the first historical audio data and a wakeup feature of the first audio data, and sends the first voiceprint feature and the wakeup feature of the first audio data to the second terminal.
S404, the second terminal determines whether the second terminal is in a pre-awakening state or not under the condition that the first voiceprint feature and the awakening feature of the first audio data sent by the first terminal are received, and acquires second historical audio data in the stored historical time period under the condition that the second terminal is in the pre-awakening state.
It should be noted that, when the second terminal is not in the pre-wake-up state, the subsequent processing manner may refer to the processing manners of S304 to S314 in the embodiment shown in fig. 3, which is not described herein again.
And S405, the second terminal acquires a second voiceprint characteristic corresponding to the second historical audio data.
And S406, the second terminal determines whether to awaken the second terminal according to the first voiceprint feature and the second voiceprint feature, and executes the steps S407 to S412 under the condition that the second terminal is determined to be awakened, and executes the steps S413 to S414 under the condition that the second terminal is not determined to be awakened.
S407, the second terminal acquires the awakening characteristic of the second historical audio data.
And S408, the second terminal determines the terminal to be awakened from the first terminal and the second terminal according to the awakening characteristics of the first audio data and the awakening characteristics of the second historical audio data.
S409, the second terminal determines whether the terminal to be wakened is the second terminal, if the terminal to be wakened is the second terminal, step S410 is executed, and if the terminal to be wakened is the first terminal, step S411 is executed.
And S410, the second terminal controls the second terminal to wake up.
S411, the second terminal sends a wake-up instruction to the first terminal.
S412, after receiving the wake-up command sent by the second terminal, the first terminal wakes up the first terminal according to the wake-up command.
S413, the second terminal sends the wakeup rejection message to the first terminal.
And S414, after receiving the wake-up response message sent by the second terminal, the first terminal controls the first terminal to wake up.
The specific implementation manners of the steps S401 to S414 can refer to the specific implementation manners of the embodiment shown in fig. 1 and the embodiment shown in fig. 2, and are not described herein again.
By adopting the method, after the first terminal meets the preset awakening condition, the first voiceprint characteristic and the awakening characteristic of the first audio data can be sent to the second terminal, the second terminal can obtain the second voiceprint characteristic corresponding to the second historical audio data after receiving the first voiceprint characteristic and the awakening characteristic of the first audio data and determining whether to awaken the second terminal according to the first voiceprint characteristic and the second voiceprint characteristic under the condition that the second terminal is in the pre-awakening state, the awakening characteristic of the second historical audio data can be obtained under the condition that the second terminal is determined to be awakened, the terminal to be awakened responding to the awakening of the user is determined according to the awakening characteristic of the first audio data and the awakening characteristic of the second historical audio data, and the terminal to be awakened is controlled to be awakened, so that the timeout judgment timer is not required, after receiving the first voiceprint feature and the awakening feature of the first audio data sent by the first terminal, the second terminal can determine the terminal to be awakened and control the terminal to be awakened, so that the awakening response speed can be increased, and the user experience is improved.
Fig. 5 is a schematic structural diagram illustrating a wake-up control apparatus according to an exemplary embodiment, the wake-up control apparatus being applied to a first terminal, as shown in fig. 5, and the wake-up control apparatus including:
a first data acquisition module 501 configured to acquire audio data in real time;
a first data obtaining module 502 configured to obtain first historical audio data in a stored historical time period when first audio data collected in a target time period meets a preset wake-up condition, where the historical time period includes a preset time period before the target time period;
a voiceprint feature obtaining module 503, configured to obtain a first voiceprint feature corresponding to the first historical audio data, and send the first voiceprint feature to a second terminal, so that the second terminal obtains stored second historical audio data in the historical time period after receiving the first voiceprint feature, and sends wakeup response information to the first terminal according to the second historical audio data;
an information receiving module 504 configured to receive the wake-up response information sent by the second terminal;
a terminal determining module 505 configured to determine a terminal to be woken from the first terminal and the second terminal according to the wake-up response information;
a first wake-up module 506 configured to control the terminal to be woken up to wake up.
Optionally, the preset wake-up condition includes:
the first audio data comprises a preset awakening word; alternatively, the first and second electrodes may be,
the first audio data comprises target keywords in the preset awakening words, the target keywords comprise the first N keywords in the preset awakening words, N is an integer which is larger than 0 and smaller than a target number, and the target number is the number of the keywords in the preset awakening words.
Optionally, the terminal determining module 505 includes:
a terminal determining submodule configured to determine, from the first terminal and the second terminal, a target terminal closest to a user according to the wake-up feature of the first audio data and the wake-up feature of the second historical audio data, and use the target terminal as the terminal to be woken up, where the wake-up response information includes the wake-up feature of the second historical audio data, and the wake-up feature is used to characterize a distance between the terminal and the user; alternatively, the first and second electrodes may be,
and in the case that the wake-up response message comprises a refusal wake-up message, taking the first terminal as the terminal to be woken up.
Optionally, the first wake-up module 506 includes:
the wake-up submodule is configured to control the first terminal to wake up when the terminal to be woken up is the first terminal; alternatively, the first and second electrodes may be,
and sending a wake-up instruction to the second terminal under the condition that the terminal to be woken up is the second terminal, so that the second terminal can be woken up according to the wake-up instruction.
By the device, the first voiceprint feature can be sent to the second terminal after the first terminal meets the preset awakening condition, the second terminal can acquire the stored second historical audio data in the historical time period after receiving the first voiceprint feature, and awakening response information is sent to the first terminal according to the second historical audio data, so that the first terminal does not need to wait for the overtime judgment timer to be overtime, the first terminal can determine the terminal to be awakened after receiving the awakening response information sent by the second terminal, and the terminal to be awakened is controlled to be awakened, so that the awakening response speed can be improved, and the user experience is improved.
Fig. 6 is a schematic structural diagram illustrating a second wake-up control apparatus according to an exemplary embodiment, which is applied to a second terminal, and as shown in fig. 6, the apparatus includes:
a second data acquisition module 601 configured to acquire audio data in real time;
the second data acquisition module 602 is configured to, in the case of receiving a first voiceprint feature corresponding to first historical audio data acquired within a historical time period sent by a first terminal, acquire stored second historical audio data within the historical time period;
the information sending module 603 is configured to send wakeup response information to the first terminal according to the second historical audio data, so that the first terminal determines a terminal to be waken according to the wakeup response information and controls the terminal to be waken to wake up.
Optionally, the wake response message includes a wake-up rejection message or a wake-up feature of the second historical audio data, fig. 7 is a schematic structural diagram of a third wake-up control apparatus according to an exemplary embodiment, and as shown in fig. 7, the apparatus further includes:
a wake-up determining module 604 configured to determine whether to wake up the second terminal according to the second historical audio data;
the information sending module 603 is configured to:
sending a wake-up feature of the second historical audio data to the first terminal in case it is determined to wake up the second terminal; alternatively, the first and second electrodes may be,
and sending the awakening refusing information to the first terminal under the condition of determining not to awaken the second terminal.
Optionally, fig. 8 is a schematic structural diagram of a fourth wake-up control apparatus according to an exemplary embodiment, and as shown in fig. 8, the apparatus further includes:
a data determination module 605 configured to determine whether the second historical audio data comprises voice data;
the wake determination module 604, configured to:
under the condition that the second historical audio data comprises the voice data, acquiring a second voiceprint feature corresponding to the second historical audio data, and determining whether to awaken the second terminal or not according to the first voiceprint feature and the second voiceprint feature; alternatively, the first and second electrodes may be,
in a case where the second historical audio data does not include the voice data, it is determined not to wake up the second terminal.
Optionally, the wake-up determining module 604 includes:
a user judgment sub-module configured to determine whether the first historical audio data and the second historical audio data belong to the same user according to the first voiceprint feature and the second voiceprint feature;
a wake-up determination sub-module configured to determine to wake up the second terminal in a case where it is determined that the first and second historical audio data belong to the same user, or determine not to wake up the second terminal in a case where it is determined that the first and second historical audio data do not belong to the same user.
Optionally, fig. 9 is a schematic structural diagram of a fifth wake-up control apparatus according to an exemplary embodiment, and as shown in fig. 9, the apparatus further includes:
an instruction receiving module 606 configured to receive a wake-up instruction sent by the first terminal when the terminal to be woken up is the second terminal;
a second wake-up module 607 configured to wake up the second terminal according to the wake-up instruction.
By the device, after receiving the first voiceprint feature corresponding to the first historical audio data sent by the first terminal, the second terminal can acquire the second historical audio data in the stored historical time period and send the awakening response information to the first terminal according to the second historical audio data, so that the first terminal does not need to wait for the timeout of the timeout judgment timer, can directly determine the terminal to be awakened according to the awakening response information and control the terminal to be awakened, the awakening response speed can be improved, and the user experience is improved.
With regard to the apparatus in the above-described embodiment, the specific manner in which each module performs the operation has been described in detail in the embodiment related to the method, and will not be elaborated here.
The present disclosure also provides a computer readable storage medium having stored thereon computer program instructions which, when executed by a processor, implement the steps of the wake-up control method provided by the present disclosure.
Fig. 10 is a block diagram illustrating a terminal 1000 in accordance with an exemplary embodiment. For example, terminal 1000 can be a mobile telephone, a computer, a digital broadcast terminal, a messaging device, a game console, a tablet device, a medical device, an exercise device, a personal digital assistant, and the like.
Referring to FIG. 10, terminal 1000 can include one or more of the following components: processing component 1002, memory 1004, power component 1006, multimedia component 1008, audio component 1010, input/output (I/O) interface 1012, sensor component 1014, and communications component 1016.
Processing component 1002 generally controls overall operation of terminal 1000, such as operations associated with display, telephone calls, data communications, camera operations, and recording operations. The processing component 1002 may include one or more processors 1020 to execute instructions to perform all or some of the steps of the wake-up control method described above. Further, processing component 1002 may include one or more modules that facilitate interaction between processing component 1002 and other components. For example, the processing component 1002 may include a multimedia module to facilitate interaction between the multimedia component 1008 and the processing component 1002.
Memory 1004 is configured to store various types of data to support operation at terminal 1000. Examples of such data include instructions for any application or method operating on terminal 1000, contact data, phonebook data, messages, pictures, videos, and the like. The memory 1004 may be implemented by any type or combination of volatile or non-volatile memory devices such as Static Random Access Memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable programmable read-only memory (EPROM), programmable read-only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, magnetic or optical disks.
Power components 1006 provide power to the various components of terminal 1000. Power components 1006 can include a power management system, one or more power sources, and other components associated with generating, managing, and distributing power for terminal 1000.
The multimedia component 1008 includes a screen that provides an output interface between the terminal 1000 and a user. In some embodiments, the screen may include a Liquid Crystal Display (LCD) and a Touch Panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive an input signal from a user. The touch panel includes one or more touch sensors to sense touch, slide, and gestures on the touch panel. The touch sensor may not only sense the boundary of a touch or slide action, but also detect the duration and pressure associated with the touch or slide operation. In some embodiments, the multimedia component 1008 includes a front facing camera and/or a rear facing camera. The front camera and/or the rear camera can receive external multimedia data when the terminal 1000 is in an operation mode, such as a photographing mode or a video mode. Each front camera and rear camera may be a fixed optical lens system or have a focal length and optical zoom capability.
The audio component 1010 is configured to output and/or input audio signals. For example, audio component 1010 includes a Microphone (MIC) configured to receive external audio signals when terminal 1000 is in an operational mode, such as call mode, record mode, and voice recognition mode. The received audio signal may further be stored in the memory 1004 or transmitted via the communication component 1016. In some embodiments, audio component 1010 also includes a speaker for outputting audio signals. I/O interface 1012 provides an interface between processing component 1002 and peripheral interface modules, which may be keyboards, click wheels, buttons, etc. These buttons may include, but are not limited to: a home button, a volume button, a start button, and a lock button.
Sensor assembly 1014 includes one or more sensors for providing various aspects of status assessment for terminal 1000. For example, sensor assembly 1014 can detect an open/closed state of terminal 1000, relative positioning of components such as a display and keypad of terminal 1000, sensor assembly 1014 can also detect a change in position of terminal 1000 or a component of terminal 1000, presence or absence of user contact with terminal 1000, orientation or acceleration/deceleration of terminal 1000, and a change in temperature of terminal 1000. The sensor assembly 1014 may include a proximity sensor configured to detect the presence of a nearby object without any physical contact. The sensor assembly 1014 may also include a light sensor, such as a CMOS or CCD image sensor, for use in imaging applications. In some embodiments, the sensor assembly 1014 may also include an acceleration sensor, a gyroscope sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.
Communications component 1016 is configured to facilitate communications between terminal 1000 and other devices in a wired or wireless manner. Terminal 1000 can access a wireless network based on a communication standard, such as WiFi, 2G or 3G, or a combination thereof. In an exemplary embodiment, the communication component 1016 receives a broadcast signal or broadcast related information from an external broadcast management system via a broadcast channel. In an exemplary embodiment, the communications component 1016 further includes a Near Field Communication (NFC) module to facilitate short-range communications. For example, the NFC module may be implemented based on Radio Frequency Identification (RFID) technology, infrared data association (IrDA) technology, Ultra Wideband (UWB) technology, Bluetooth (BT) technology, and other technologies.
In an exemplary embodiment, terminal 1000 can be implemented by one or more Application Specific Integrated Circuits (ASICs), Digital Signal Processors (DSPs), Digital Signal Processing Devices (DSPDs), Programmable Logic Devices (PLDs), Field Programmable Gate Arrays (FPGAs), controllers, micro-controllers, microprocessors or other electronic components for performing the wake-up control methods described above.
In an exemplary embodiment, a non-transitory computer-readable storage medium comprising instructions, such as memory 1004, that are executable by processor 1020 of terminal 1000 to perform the wake-up control method described above is also provided. For example, the non-transitory computer readable storage medium may be a ROM, a Random Access Memory (RAM), a CD-ROM, a magnetic tape, a floppy disk, an optical data storage device, and the like.
In another exemplary embodiment, a computer program product is also provided, which contains a computer program executable by a programmable apparatus, the computer program having code portions for performing the wake-up control method described above when executed by the programmable apparatus.
Other embodiments of the disclosure will be apparent to those skilled in the art from consideration of the specification and practice of the disclosure. This application is intended to cover any variations, uses, or adaptations of the disclosure following, in general, the principles of the disclosure and including such departures from the present disclosure as come within known or customary practice within the art to which the disclosure pertains. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the disclosure being indicated by the following claims.
It will be understood that the present disclosure is not limited to the precise arrangements described above and shown in the drawings and that various modifications and changes may be made without departing from the scope thereof. The scope of the present disclosure is limited only by the appended claims.

Claims (20)

1. A wake-up control method is applied to a first terminal, and the method comprises the following steps:
collecting audio data in real time;
under the condition that first audio data collected in a target time period meet a preset awakening condition, obtaining first historical audio data in a stored historical time period, wherein the historical time period comprises a preset time period before the target time period;
acquiring a first voiceprint feature corresponding to the first historical audio data, and sending the first voiceprint feature to a second terminal, so that the second terminal acquires stored second historical audio data in the historical time period after receiving the first voiceprint feature, and sends awakening response information to the first terminal according to the second historical audio data;
receiving the awakening response information sent by the second terminal;
determining a terminal to be awakened from the first terminal and the second terminal according to the awakening response information;
and controlling the terminal to be awakened.
2. The method of claim 1, wherein the preset wake-up condition comprises:
the first audio data comprise preset awakening words; alternatively, the first and second electrodes may be,
the first audio data comprise target keywords in the preset awakening words, the target keywords comprise the first N keywords in the preset awakening words, N is an integer which is larger than 0 and smaller than a target number, and the target number is the number of the keywords in the preset awakening words.
3. The method of claim 1, wherein the wake response information comprises a rejection wake information or a wake characteristic of the second historical audio data; determining, according to the wake-up response information, a terminal to be woken up from the first terminal and the second terminal includes:
under the condition that the awakening response information comprises the awakening feature of the second historical audio data, determining a target terminal closest to a user from the first terminal and the second terminal according to the awakening feature of the first audio data and the awakening feature of the second historical audio data, and taking the target terminal as the terminal to be awakened, wherein the awakening feature is used for representing the distance between the terminal and the user; alternatively, the first and second electrodes may be,
and taking the first terminal as the terminal to be awakened under the condition that the awakening response information comprises the awakening refusing information.
4. The method according to claim 1, wherein the controlling the terminal to be woken up to wake up comprises:
controlling the first terminal to wake up under the condition that the terminal to be woken up is the first terminal; alternatively, the first and second electrodes may be,
and sending a wake-up instruction to the second terminal under the condition that the terminal to be woken up is the second terminal, so that the second terminal can be woken up according to the wake-up instruction.
5. A wake-up control method is applied to a second terminal, and comprises the following steps:
collecting audio data in real time;
under the condition that a first voiceprint characteristic corresponding to first historical audio data collected in a historical time period and sent by a first terminal is received, second historical audio data in the historical time period are obtained;
and sending awakening response information to the first terminal according to the second historical audio data, so that the first terminal determines a terminal to be awakened according to the awakening response information and controls the terminal to be awakened.
6. The method of claim 5, wherein the wake response message comprises a wake-up rejection message or a wake-up feature of the second historical audio data, and wherein before sending the wake response message to the first terminal according to the second historical audio data, the method further comprises:
determining whether to awaken the second terminal according to the second historical audio data;
the sending of the wake-up response information to the first terminal according to the second historical audio data includes:
sending a wake-up feature of the second historical audio data to the first terminal on a condition that the second terminal is determined to be woken up; alternatively, the first and second electrodes may be,
and sending the awakening refusing information to the first terminal under the condition of determining not to awaken the second terminal.
7. The method according to claim 6, wherein before said determining whether to wake up the second terminal according to the second historical audio data, the method further comprises:
determining whether the second historical audio data comprises voice data;
the determining whether to wake up the second terminal according to the second historical audio data includes:
under the condition that the second historical audio data comprises the voice data, acquiring a second voiceprint feature corresponding to the second historical audio data, and determining whether to awaken the second terminal or not according to the first voiceprint feature and the second voiceprint feature; alternatively, the first and second electrodes may be,
determining not to wake up the second terminal if the second historical audio data does not include the voice data.
8. The method of claim 7, wherein the determining whether to wake up the second terminal according to the first voiceprint feature and the second voiceprint feature comprises:
determining whether the first historical audio data and the second historical audio data belong to the same user according to the first voiceprint feature and the second voiceprint feature;
and when the first historical audio data and the second historical audio data belong to the same user, determining to awaken the second terminal, or when the first historical audio data and the second historical audio data do not belong to the same user, determining not to awaken the second terminal.
9. The method of claim 5, further comprising:
receiving a wake-up instruction sent by the first terminal under the condition that the terminal to be woken up is the second terminal;
and awakening the second terminal according to the awakening instruction.
10. A wake-up control apparatus, applied to a first terminal, the apparatus comprising:
a first data acquisition module configured to acquire audio data in real time;
the first data acquisition module is configured to acquire first historical audio data in a stored historical time period under the condition that first audio data acquired in a target time period meet a preset awakening condition, wherein the historical time period comprises a preset time period before the target time period;
the voiceprint feature acquisition module is configured to acquire a first voiceprint feature corresponding to the first historical audio data and send the first voiceprint feature to a second terminal, so that the second terminal acquires stored second historical audio data in the historical time period after receiving the first voiceprint feature and sends awakening response information to the first terminal according to the second historical audio data;
an information receiving module configured to receive the wake-up response information sent by the second terminal;
a terminal determining module configured to determine a terminal to be awakened from the first terminal and the second terminal according to the awakening response information;
and the first awakening module is configured to control the terminal to be awakened.
11. The apparatus of claim 10, wherein the preset wake-up condition comprises:
the first audio data comprise preset awakening words; alternatively, the first and second electrodes may be,
the first audio data comprise target keywords in the preset awakening words, the target keywords comprise the first N keywords in the preset awakening words, N is an integer which is larger than 0 and smaller than a target number, and the target number is the number of the keywords in the preset awakening words.
12. The apparatus of claim 10, wherein the wake response information comprises a rejection wake message or a wake characteristic of the second historical audio data; the terminal determination module includes:
the terminal determination submodule is configured to determine a target terminal closest to a user from the first terminal and the second terminal according to the wake-up feature of the first audio data and the wake-up feature of the second historical audio data under the condition that the wake-up response information includes the wake-up feature of the second historical audio data, and use the target terminal as the terminal to be woken up, wherein the wake-up feature is used for representing the distance between the terminal and the user; alternatively, the first and second electrodes may be,
and taking the first terminal as the terminal to be awakened under the condition that the awakening response information comprises the awakening refusing information.
13. The apparatus of claim 10, wherein the first wake-up module comprises:
the wake-up sub-module is configured to control the first terminal to wake up when the terminal to be woken up is the first terminal; alternatively, the first and second electrodes may be,
and sending a wake-up instruction to the second terminal under the condition that the terminal to be woken up is the second terminal, so that the second terminal can be woken up according to the wake-up instruction.
14. A wake-up control apparatus, for application to a second terminal, the apparatus comprising:
a second data acquisition module configured to acquire audio data in real time;
the second data acquisition module is configured to acquire stored second historical audio data in a historical time period when a first voiceprint feature corresponding to first historical audio data acquired in the historical time period and sent by a first terminal is received;
and the information sending module is configured to send awakening response information to the first terminal according to the second historical audio data, so that the first terminal determines a terminal to be awakened according to the awakening response information and controls the terminal to be awakened.
15. The apparatus of claim 14, wherein the wake response information comprises a wake rejection information or a wake characteristic of the second historical audio data, the apparatus further comprising:
a wake-up determination module configured to determine whether to wake up the second terminal according to the second historical audio data;
the information sending module is configured to:
sending a wake-up feature of the second historical audio data to the first terminal on a condition that the second terminal is determined to be woken up; alternatively, the first and second electrodes may be,
and sending the awakening refusing information to the first terminal under the condition of determining not to awaken the second terminal.
16. The apparatus of claim 15, further comprising:
a data determination module configured to determine whether the second historical audio data comprises voice data;
the wake determination module configured to:
under the condition that the second historical audio data comprises the voice data, acquiring a second voiceprint feature corresponding to the second historical audio data, and determining whether to awaken the second terminal or not according to the first voiceprint feature and the second voiceprint feature; alternatively, the first and second electrodes may be,
determining not to wake up the second terminal if the second historical audio data does not include the voice data.
17. The apparatus of claim 16, wherein the wake determination module comprises:
the user judgment sub-module is configured to determine whether the first historical audio data and the second historical audio data belong to the same user according to the first voiceprint feature and the second voiceprint feature;
a wake-up determination sub-module configured to determine to wake up the second terminal in a case where it is determined that the first and second historical audio data belong to the same user, or determine not to wake up the second terminal in a case where it is determined that the first and second historical audio data do not belong to the same user.
18. The apparatus of claim 14, further comprising:
the instruction receiving module is configured to receive a wake-up instruction sent by the first terminal under the condition that the terminal to be woken up is the second terminal;
and the second awakening module is configured to awaken the second terminal according to the awakening instruction.
19. A computer readable storage medium having stored thereon computer program instructions, characterized in that the program instructions, when executed by a processor, implement the steps of the method of any of claims 1-4, or the program instructions, when executed by a processor, implement the steps of the method of any of claims 5-9.
20. A terminal, comprising:
a memory having a computer program stored thereon;
a processor for executing the computer program in the memory to carry out the steps of the method of any one of claims 1 to 4 or to carry out the steps of the method of any one of claims 5 to 9.
CN202011349897.0A 2020-11-26 2020-11-26 Wake-up control method and device, storage medium and terminal Pending CN112489650A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011349897.0A CN112489650A (en) 2020-11-26 2020-11-26 Wake-up control method and device, storage medium and terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011349897.0A CN112489650A (en) 2020-11-26 2020-11-26 Wake-up control method and device, storage medium and terminal

Publications (1)

Publication Number Publication Date
CN112489650A true CN112489650A (en) 2021-03-12

Family

ID=74935022

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011349897.0A Pending CN112489650A (en) 2020-11-26 2020-11-26 Wake-up control method and device, storage medium and terminal

Country Status (1)

Country Link
CN (1) CN112489650A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115827075A (en) * 2022-11-21 2023-03-21 小米汽车科技有限公司 Device control method, device and storage medium

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170345422A1 (en) * 2016-05-24 2017-11-30 Samsung Electronics Co., Ltd. Electronic devices having speech recognition functionality and operating methods of electronic devices
CN108259280A (en) * 2018-02-06 2018-07-06 北京语智科技有限公司 A kind of implementation method, the system of Inteldectualization Indoors control
CN109391528A (en) * 2018-08-31 2019-02-26 百度在线网络技术(北京)有限公司 Awakening method, device, equipment and the storage medium of speech-sound intelligent equipment
US20190214002A1 (en) * 2018-01-09 2019-07-11 Lg Electronics Inc. Electronic device and method of controlling the same
US20190311720A1 (en) * 2018-04-09 2019-10-10 Amazon Technologies, Inc. Device arbitration by multiple speech processing systems
CN110364153A (en) * 2019-07-30 2019-10-22 恒大智慧科技有限公司 A kind of distributed sound control method, system, computer equipment and storage medium
WO2020019176A1 (en) * 2018-07-24 2020-01-30 华为技术有限公司 Method for updating wake-up voice of voice assistant by terminal, and terminal
CN111210829A (en) * 2020-02-19 2020-05-29 腾讯科技(深圳)有限公司 Speech recognition method, apparatus, system, device and computer readable storage medium

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170345422A1 (en) * 2016-05-24 2017-11-30 Samsung Electronics Co., Ltd. Electronic devices having speech recognition functionality and operating methods of electronic devices
US20190214002A1 (en) * 2018-01-09 2019-07-11 Lg Electronics Inc. Electronic device and method of controlling the same
CN108259280A (en) * 2018-02-06 2018-07-06 北京语智科技有限公司 A kind of implementation method, the system of Inteldectualization Indoors control
US20190311720A1 (en) * 2018-04-09 2019-10-10 Amazon Technologies, Inc. Device arbitration by multiple speech processing systems
WO2020019176A1 (en) * 2018-07-24 2020-01-30 华为技术有限公司 Method for updating wake-up voice of voice assistant by terminal, and terminal
CN109391528A (en) * 2018-08-31 2019-02-26 百度在线网络技术(北京)有限公司 Awakening method, device, equipment and the storage medium of speech-sound intelligent equipment
CN110364153A (en) * 2019-07-30 2019-10-22 恒大智慧科技有限公司 A kind of distributed sound control method, system, computer equipment and storage medium
CN111210829A (en) * 2020-02-19 2020-05-29 腾讯科技(深圳)有限公司 Speech recognition method, apparatus, system, device and computer readable storage medium

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115827075A (en) * 2022-11-21 2023-03-21 小米汽车科技有限公司 Device control method, device and storage medium
CN115827075B (en) * 2022-11-21 2024-02-23 小米汽车科技有限公司 Equipment control method, device, equipment and storage medium

Similar Documents

Publication Publication Date Title
CN107919123B (en) Multi-voice assistant control method, device and computer readable storage medium
CN109920418B (en) Method and device for adjusting awakening sensitivity
EP3136793B1 (en) Method and apparatus for awakening electronic device
CN105898032B (en) method and device for adjusting prompt tone
EP3933570A1 (en) Method and apparatus for controlling a voice assistant, and computer-readable storage medium
EP4184506A1 (en) Audio processing
CN111063354B (en) Man-machine interaction method and device
CN109087650B (en) Voice wake-up method and device
EP3024211A1 (en) Method and device for announcing voice call
CN112037787A (en) Wake-up control method, device and computer readable storage medium
CN109039877A (en) A kind of method, apparatus, electronic equipment and storage medium showing unread message quantity
CN111696553A (en) Voice processing method and device and readable medium
CN110349578A (en) Equipment wakes up processing method and processing device
CN109522058B (en) Wake-up method, device, terminal and storage medium
CN111580773A (en) Information processing method, device and storage medium
EP3125514A1 (en) Method and device for state notification
CN112509596A (en) Wake-up control method and device, storage medium and terminal
CN113361305B (en) Fingerprint acquisition method, electronic device and computer readable storage medium
CN112489650A (en) Wake-up control method and device, storage medium and terminal
CN105786561B (en) Method and device for calling process
CN111127846A (en) Door-knocking reminding method, door-knocking reminding device and electronic equipment
CN111968680A (en) Voice processing method, device and storage medium
CN111857849A (en) Wake-up processing method and device, electronic equipment and storage medium
CN106657580A (en) Call processing method and device
CN107979695B (en) Network message receiving method and device and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination