CN115985323A - Voice wake-up method and device, electronic equipment and readable storage medium - Google Patents

Voice wake-up method and device, electronic equipment and readable storage medium Download PDF

Info

Publication number
CN115985323A
CN115985323A CN202310273455.XA CN202310273455A CN115985323A CN 115985323 A CN115985323 A CN 115985323A CN 202310273455 A CN202310273455 A CN 202310273455A CN 115985323 A CN115985323 A CN 115985323A
Authority
CN
China
Prior art keywords
signal value
sample
signal
calculating
value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202310273455.XA
Other languages
Chinese (zh)
Other versions
CN115985323B (en
Inventor
鲁勇
丁萌
刘波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Intengine Technology Co Ltd
Original Assignee
Beijing Intengine Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Intengine Technology Co Ltd filed Critical Beijing Intengine Technology Co Ltd
Priority to CN202310273455.XA priority Critical patent/CN115985323B/en
Publication of CN115985323A publication Critical patent/CN115985323A/en
Application granted granted Critical
Publication of CN115985323B publication Critical patent/CN115985323B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Landscapes

  • Telephone Function (AREA)

Abstract

The application discloses a voice awakening method, a voice awakening device, electronic equipment and a readable storage medium, wherein the voice awakening method comprises the following steps: collecting a sample signal within a preset time length; counting signal values corresponding to each frame of sample frames in the sample signals; calculating a background signal value corresponding to the sample signal based on the statistical signal value; and when the voice awakening operation triggered by the target equipment is detected, awakening the target equipment according to the background signal value. The voice awakening scheme provided by the application does not need to consume a large amount of computing power of the equipment to carry out long-term standby awakening, and avoids the situation that the equipment is heated and the service life of the equipment is shortened.

Description

Voice wake-up method and device, electronic equipment and readable storage medium
Technical Field
The present application relates to the field of communications, and in particular, to a voice wake-up method and apparatus, an electronic device, and a readable storage medium.
Background
With the advent of the mobile internet and the artificial intelligence era, voice interaction has gained unprecedented growth in recent years, wherein voice wakeup technology, as a special voice recognition technology, becomes an important component for interaction between users and machines. The goal of a voice wake-up system is to wake up a device without manual operation.
In the current voice wake-up scheme, an inertial filter or a scheme based on a neural network model is usually adopted for voice wake-up, however, under the two schemes, a large amount of calculation power of the device is consumed for long-term standby wake-up, which may cause the device to generate heat, thereby reducing the service life of the device.
Disclosure of Invention
In view of the above technical problems, the present application provides a voice wake-up method, apparatus, electronic device, and readable storage medium, which do not need to consume a large amount of computing power of the device to perform long-term standby wake-up, and avoid a situation that the service life of the device is reduced due to heating of the device.
In order to solve the above technical problem, the present application provides a voice wake-up method, including:
collecting a sample signal within a preset time length;
counting signal values corresponding to each frame of sample frames in the sample signals;
calculating a background signal value corresponding to the sample signal based on the statistical signal value;
and when the voice awakening operation triggered aiming at the target equipment is detected, awakening the target equipment according to the background signal value.
Optionally, in some embodiments of the present application, the calculating a background signal value corresponding to the sample signal based on the statistical signal value includes:
acquiring a historical signal value in historical time;
and calculating a background signal value corresponding to the sample signal according to the fluctuation between the historical signal value and the statistical signal value.
Optionally, in some embodiments of the application, the calculating a background signal value corresponding to the sample signal according to a fluctuation between the historical signal value and a statistical signal value includes:
determining an initial signal value among the counted signal values;
calculating a difference between the initial signal value and the historical signal value;
adjusting the historical signal value according to the difference value to obtain an adjusted signal value;
and adjusting the adjusted signal value according to the fluctuation among other signal values except the initial signal value to obtain a background signal value corresponding to the sample signal.
Optionally, in some embodiments of the application, the adjusting the historical signal value according to the difference to obtain an adjusted signal value includes:
when the difference value is detected to be larger than a threshold value, calculating the sum of the historical signal value and the preset value to obtain an adjusted signal value;
and when the difference value is smaller than a threshold value, calculating the difference between the historical signal value and the preset value to obtain an adjusted signal value.
Optionally, in some embodiments of the present application, the waking up the target device according to the background signal value when the voice wake-up operation triggered for the target device is detected includes:
when voice awakening operation triggered by target equipment is detected, acquiring an operation signal value corresponding to the voice awakening operation;
detecting whether the operation signal value is greater than the background signal value;
when the operation signal value is detected to be larger than the background signal value, the target device is awakened.
Optionally, in some embodiments of the present application, the method further includes:
periodically updating the background signal value to obtain an updated background signal value;
when detecting a voice wake-up operation triggered for a target device, waking up the target device according to the background signal value, including: and when the voice awakening operation triggered aiming at the target equipment is detected, awakening the target equipment according to the updated background signal value.
Correspondingly, this application still provides a voice wake-up device, includes:
the acquisition module is used for acquiring a sample signal within a preset time length;
the statistical module is used for counting the signal value corresponding to each frame of sample frame in the sample signal;
the calculating module is used for calculating a background signal value corresponding to the sample signal based on the statistical signal value;
and the awakening module is used for awakening the target equipment according to the background signal value when the voice awakening operation triggered by the target equipment is detected.
Optionally, in some embodiments of the application, the calculation module includes:
an acquisition unit configured to acquire a history signal value in a history time;
and the calculating unit is used for calculating a background signal value corresponding to the sample signal according to the fluctuation between the historical signal value and the statistical signal value.
The present application further provides an electronic device comprising a memory and a processor, wherein the memory stores a computer program, and the processor implements the steps of the method when executing the computer program.
The present application also provides a computer storage medium having a computer program stored thereon, which, when being executed by a processor, carries out the steps of the method as described above.
As described above, the present application provides a voice wake-up method, an apparatus, an electronic device, and a readable storage medium, where the voice wake-up method includes: collecting a sample signal within a preset time length; counting signal values corresponding to each frame of sample frames in the sample signals; calculating a background signal value corresponding to the sample signal based on the statistical signal value; and when the voice awakening operation triggered aiming at the target equipment is detected, awakening the target equipment according to the background signal value. In the voice awakening scheme provided by the application, a signal value corresponding to each frame of sample in a sample signal is utilized to calculate a background signal value corresponding to the sample signal, and the calculated background signal value is utilized to awaken the target device, so that voice awakening is not required to be performed through an inertial filter or a scheme based on a neural network model, a large amount of computing power of the device is not required to be consumed for long-term standby awakening, and the situation that the service life of the device is reduced due to heating of the device is avoided.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the present application and together with the description, serve to explain the principles of the application. In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings needed to be used in the description of the embodiments will be briefly described below, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without inventive exercise.
Fig. 1 is a schematic structural diagram of a voice wake-up system according to an embodiment of the present application;
fig. 2 is a schematic flowchart of a voice wake-up method according to an embodiment of the present application;
fig. 3 is a schematic structural diagram of a voice wake-up apparatus according to an embodiment of the present application;
fig. 4 is another schematic structural diagram of a voice wake-up apparatus according to an embodiment of the present application;
fig. 5 is a schematic structural diagram of an intelligent terminal provided in an embodiment of the present application.
The implementation, functional features and advantages of the objectives of the present application will be further explained with reference to the accompanying drawings. With the above figures, there are shown specific embodiments of the present application, which will be described in more detail below. These drawings and written description are not intended to limit the scope of the inventive concepts in any manner, but rather to illustrate the inventive concepts to those skilled in the art by reference to specific embodiments.
Detailed Description
Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, like numbers in different drawings represent the same or similar elements unless otherwise indicated. The implementations described in the following exemplary examples do not represent all implementations consistent with the present application. Rather, they are merely examples of apparatus and methods consistent with certain aspects of the present application, as detailed in the appended claims.
It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, a reference to an element identified by the phrase "comprising one of ...a of 82303030, or an element defined by the phrase" comprising another identical element does not exclude the presence of the same element in a process, method, article, or apparatus comprising the element, and elements having the same designation may or may not have the same meaning in different embodiments of the application, the particular meaning being determined by its interpretation in the particular embodiment or by further reference to the context of the particular embodiment.
It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application.
In the following description, suffixes such as "module", "component", or "unit" used to indicate elements are used only for facilitating the description of the present application, and have no particular meaning in themselves. Thus, "module", "component" or "unit" may be used mixedly.
The following embodiments related to the present application are specifically described, and it should be noted that the order of description of the embodiments in the present application is not limited to the order of priority of the embodiments.
The embodiment of the application provides a voice awakening method and device, a storage medium and electronic equipment. Specifically, the voice wake-up method according to the embodiment of the present application may be executed by an electronic device, where the electronic device may be a terminal. The electronic device may be an electronic device such as a smart phone, a tablet computer, a notebook computer, a touch screen, a game machine, a Personal Computer (PC), a Personal Digital Assistant (PDA), and the like, and the electronic device may further include a client, which may be a voice wake-up client or other clients. The electronic device can be connected with the server in a wired or wireless manner, the server can be an independent physical server, a server cluster or a distributed system formed by a plurality of physical servers, and a cloud server providing basic cloud computing services such as cloud service, a cloud database, cloud computing, cloud functions, cloud storage, network service, cloud communication, middleware service, domain name service, security service, CDN, a big data and artificial intelligence platform and the like.
For example, when the voice wake-up method is implemented in an electronic device, after the electronic device collects a sample signal within a preset time period, the electronic device counts a signal value corresponding to each frame of the sample signal, then, the electronic device calculates a background signal value corresponding to the sample signal based on the counted signal value, and when a voice wake-up operation triggered by a target device is detected, the electronic device wakes up the target device according to the background signal value.
Referring to fig. 1, fig. 1 is a system schematic diagram of a voice wake-up apparatus according to an embodiment of the present disclosure. The system may include at least one electronic device 1000, at least one server or personal computer 2000. The electronic device 1000 held by the user can be connected to different servers or personal computers through a network. The electronic device 1000 may be an electronic device having computing hardware capable of supporting and executing software products corresponding to multimedia. Additionally, the electronic device 1000 may also have one or more multi-touch sensitive screens for sensing and obtaining input by a user through touch or slide operations performed at multiple points of the one or more touch sensitive display screens. In addition, the electronic apparatus 1000 may be interconnected with a server or a personal computer 2000 through a network. The network may be a wireless network or a wired network, such as a Wireless Local Area Network (WLAN), a Local Area Network (LAN), a cellular network, a 2G network, a 3G network, a 4G network, a 5G network, etc. In addition, different electronic devices 1000 may also be connected to other embedded platforms or to servers, personal computers, and the like using their own bluetooth networks or hotspot networks. The server may be an independent physical server, a server cluster or a distributed system formed by a plurality of physical servers, or a cloud server providing basic cloud computing services such as cloud service, a cloud database, cloud computing, a cloud function, cloud storage, network service, cloud communication, middleware service, domain name service, security service, CDN, and a big data and artificial intelligence platform.
The embodiment of the application provides a voice wake-up method, which can be executed by an electronic device. The electronic equipment comprises a touch display screen and a processor, wherein the touch display screen is used for presenting a graphical user interface and receiving an operation instruction generated by a user acting on the graphical user interface. When a user operates the graphical user interface through the touch display screen, the graphical user interface can control the local content of the electronic equipment through responding to the received operation instruction, and can also control the content of the server end through responding to the received operation instruction. For example, the operation instructions generated by the user acting on the graphical user interface include instructions for processing the initial audio data, and the processor is configured to launch the corresponding application program after receiving the instructions provided by the user. Further, the processor is configured to render and draw a graphical user interface associated with the application on the touch-sensitive display screen. A touch display screen is a multi-touch sensitive screen capable of sensing a touch or slide operation performed simultaneously at a plurality of points on the screen. The user uses fingers to perform touch operation on the graphical user interface, and the graphical user interface controls the corresponding operation displayed in the graphical user interface of the application when the touch operation is detected.
According to the voice awakening scheme, the signal value corresponding to each frame of the sample signal in the sample signal is utilized to calculate the background signal value corresponding to the sample signal, the calculated background signal value is utilized to awaken the target device, voice awakening is not needed to be carried out through an inertial filter or a scheme based on a neural network model, long-term standby awakening is not needed to be carried out due to the fact that a large amount of computing power of the device is consumed, and the situation that the service life of the device is reduced due to the fact that the device generates heat is avoided.
The following are detailed below. It should be noted that the description sequence of the following embodiments is not intended to limit the priority sequence of the embodiments.
A voice wake-up method, comprising: collecting a sample signal within a preset time length; counting signal values corresponding to each frame of sample frames in the sample signals; calculating a background signal value corresponding to the sample signal based on the statistical signal value; and when the voice awakening operation triggered aiming at the target equipment is detected, awakening the target equipment according to the background signal value.
Referring to fig. 2, fig. 2 is a flowchart illustrating a voice wake-up method according to an embodiment of the present application. The specific flow of the digital voice wake-up method may be as follows:
101. and collecting a sample signal within a preset time length.
The sample signal is an audio signal acquired within a preset time duration, the sample signal may include a human voice signal, an environmental sound signal and other types of sound signals, the sound signal may be acquired by a sound sensor (such as a microphone) built in the electronic device, the preset time duration may be 10 minutes, 20 minutes or 100 minutes, or 50 seconds, 120 seconds or 300 seconds, and may be specifically set according to actual conditions, which is not described herein again.
102. And counting the signal value corresponding to each frame of sample in the sample signal.
For example, specifically, a sample signal may be framed to obtain a plurality of sample frames corresponding to the sample signal, it should be noted that, since a human voice signal (i.e., a speech signal) may exist in the sample signal, the speech signal is macroscopically unstable, microscopically stable, and has short-time stationarity (the speech signal may be considered to be approximately constant in 10 — 30 ms), in order to facilitate subsequent voice wake-up, a long-time sample signal may be divided into short segments to be processed, each short segment is a sample frame of the present application, in this frame, characteristics of the speech signal may be considered to be stable, and the framing principle is that it must be short enough to ensure that the intra-frame signal is stable, a frame length should be less than a phoneme length, and a phoneme duration at a normal speech speed is about 50ms. In addition, the frame to be subjected to fourier analysis must contain enough vibration periods, considering that male voices are around 100 hz and female voices are around 200 hz, the conversion period is 10ms and 5 ms, that is, the length of each frame sample frame is between 10ms and 40ms, and the length of each frame sample frame can be specifically selected according to actual situations.
Further, a discrete fourier transform (also called a short-time discrete fourier transform) may be applied to a frame of signal to obtain information about frequency-energy distribution of the signal in the frame, and images of the frames in a frequency domain are concatenated, where a horizontal axis represents frequency and a vertical axis represents amplitude, so as to obtain a spectrogram of the sample signal, and in this application, the amplitude is determined as a signal value corresponding to the sample frame.
103. And calculating a background signal value corresponding to the sample signal based on the statistical signal value.
For example, specifically, a reference value may be obtained, a fluctuation between the statistical signal values is determined, and a background signal value corresponding to the sample signal is calculated based on a difference between the fluctuation and the reference value, that is, optionally, in some embodiments, the step "calculating a background signal value corresponding to the sample signal based on the statistical signal values" may specifically include:
(11) Acquiring a historical signal value in historical time;
(12) And calculating a background signal value corresponding to the sample signal according to the fluctuation between the historical signal value and the statistical signal value.
For example, the initial signal value may be determined in the statistical signal values, such as determining the signal value of the first frame sample frame of the sample signal as the initial signal value, then calculating the difference between the initial signal value and the historical signal value, and calculating the background signal value corresponding to the sample signal based on the difference and the fluctuation between the statistical signal values, that is, optionally, in some embodiments, the step "calculating the background signal value corresponding to the sample signal according to the fluctuation between the historical signal value and the statistical signal value" may specifically include:
(21) Determining an initial signal value among the counted signal values;
(22) Calculating a difference between the initial signal value and the historical signal value;
(23) Adjusting the historical signal value according to the difference value to obtain an adjusted signal value;
(24) And adjusting the adjusted signal value according to the fluctuation among other signal values except the initial signal value to obtain a background signal value corresponding to the sample signal.
For example, specifically, after determining a signal value of a first frame of a sample signal as an initial signal, obtaining a historical signal value, where the historical signal value may be an average value corresponding to signal values acquired within a historical time period, and the historical signal value may be used as an environmental background noise, further calculating a difference between the initial signal value and the historical signal value, and adjusting the historical signal value based on the difference, then calculating a difference between a signal value corresponding to a next frame of the initial signal value and the adjusted historical signal value, and further adjusting the adjusted historical signal value based on the difference until all sample frames in the sample signal are processed, and finally calculating an average value of all adjusted historical signal values, to obtain a background signal value corresponding to the sample signal.
It should be noted that, in the present application, the historical signal value is adjusted by using an add-subtract 1 device, that is, the historical signal value is greater than the set value +1, equal to the set value without adjustment, and is less than the set value-1, that is, optionally, in some embodiments, the step "adjust the historical signal value according to the difference value to obtain the adjusted signal value" may specifically include:
(31) When the difference value is detected to be larger than the threshold value, calculating the sum of the historical signal value and a preset value to obtain an adjusted signal value;
(32) And when the difference value is smaller than the threshold value, calculating the difference between the historical signal value and a preset value to obtain an adjusted signal value.
104. And when the voice awakening operation triggered aiming at the target equipment is detected, awakening the target equipment according to the background signal value.
For example, specifically, when a voice wakeup operation triggered for the target device is detected, an operation signal value corresponding to the voice wakeup operation is obtained, and the target device is triggered to perform the wakeup mode by comparing the magnitude between the operation signal value and the background signal value, that is, optionally, in some embodiments, the step "when the voice wakeup operation triggered for the target device is detected, the target device is woken up according to the background signal value" may specifically include:
(41) When voice awakening operation triggered by the target equipment is detected, acquiring an operation signal value corresponding to the voice awakening operation;
(42) Detecting whether the operation signal value is larger than the background signal value;
(43) And when the operation signal value is detected to be larger than the background signal value, waking up the target device.
For example, specifically, when a chip of a target device is powered on, a sound signal is collected, an average statistic is calculated through long-time statistics and is used as an environmental background sound noise (i.e., a historical signal value) of the application scene, then, the target device continuously collects sample signals, and calculates a signal value corresponding to each frame of sample frames, then, the target device calculates a difference between the signal value and the historical signal value and continuously updates the background signal value, and when a voice wake-up operation triggered for the target device is detected, the target device is woken up according to the background signal value, that is, it can be understood that, in some embodiments, the voice wake-up method of the present application may further include periodically updating the background signal value to obtain an updated background signal value, and the step "when the voice wake-up operation triggered for the target device is detected, the target device is woken up according to the background signal value" may specifically include: and when the voice awakening operation triggered by the target equipment is detected, awakening the target equipment according to the updated background signal value.
The voice wake-up procedure of the present application is completed above.
According to the voice awakening method, a sample signal is collected within a preset time length, then a signal value corresponding to each frame of sample frame in the sample signal is counted, a background signal value corresponding to the sample signal is calculated based on the counted signal value, when a voice awakening operation triggered by a target device is detected, the target device is awakened according to the background signal value.
In order to better implement the voice awakening method, the application also provides a voice awakening device based on the voice awakening method. The noun is the same as that in the voice wake-up method, and details of implementation may refer to the description in the method embodiment.
Referring to fig. 3, fig. 3 is a schematic structural diagram of a voice wake-up apparatus provided in the present application, where the voice wake-up apparatus may include a collecting module 201, a counting module 202, a calculating module 203, and a wake-up module 204, which may specifically be as follows:
the acquisition module 201 is configured to acquire a sample signal within a preset time period.
The sample signal is an audio signal acquired within a preset time duration, the sample signal may include a human voice signal, an environmental sound signal and other types of sound signals, the sound signal may be acquired by a sound sensor (such as a microphone) built in the electronic device, the preset time duration may be 10 minutes, 20 minutes or 100 minutes, or 50 seconds, 120 seconds or 300 seconds, and may be specifically set according to actual conditions, which is not described herein again.
The counting module 202 is configured to count a signal value corresponding to each frame of sample in the sample signal.
And a calculating module 203 for calculating a background signal value corresponding to the sample signal based on the statistical signal value.
For example, specifically, a reference value may be obtained, fluctuations between the statistical signal values are determined, and based on a difference between the fluctuations and the reference value, a background signal value corresponding to the sample signal is calculated, that is, optionally, in some embodiments, the calculation module 203 may specifically include:
an acquisition unit configured to acquire a history signal value in a history time;
and the calculating unit is used for calculating a background signal value corresponding to the sample signal according to the fluctuation between the historical signal value and the statistical signal value.
Optionally, in some embodiments, the computing unit may specifically include:
a determination subunit configured to determine an initial signal value among the counted signal values;
a calculating subunit, configured to calculate a difference between the initial signal value and the historical signal value;
the adjusting subunit is used for adjusting the historical signal value according to the difference value to obtain an adjusted signal value;
and the adjusting subunit is used for adjusting the adjusted signal value according to the fluctuation among other signal values except the initial signal value to obtain a background signal value corresponding to the sample signal.
Optionally, in some embodiments, the adjusting subunit may specifically be configured to: when the difference value is detected to be larger than the threshold value, calculating the sum of the historical signal value and a preset value to obtain an adjusted signal value; and when the difference value is smaller than the threshold value, calculating the difference between the historical signal value and a preset value to obtain an adjusted signal value.
And the wake-up module 204 is configured to wake up the target device according to the background signal value when a voice wake-up operation triggered for the target device is detected.
For example, specifically, when detecting a voice wakeup operation triggered for a target device, obtaining an operation signal value corresponding to the voice wakeup operation, and triggering the target device to perform a wakeup mode by comparing a magnitude between the operation signal value and a background signal value, optionally, in some embodiments, the wakeup module 204 may be specifically configured to: when voice awakening operation triggered by the target equipment is detected, acquiring an operation signal value corresponding to the voice awakening operation; detecting whether the operation signal value is larger than the background signal value; and when the operation signal value is detected to be larger than the background signal value, waking up the target device.
Optionally, in some embodiments, referring to fig. 4, the voice wake-up apparatus of the present application may further include an updating module 205, where the updating module 205 may be specifically configured to: and periodically updating the background signal value to obtain an updated background signal value.
Optionally, in some embodiments, the wake-up module 204 may be further configured to: and when the voice awakening operation triggered by the target equipment is detected, awakening the target equipment according to the updated background signal value.
The voice wake-up procedure of the present application is completed above.
In view of the above, according to the voice wake-up apparatus provided by the present application, the collecting module 201 collects a sample signal within a preset time period, the counting module 202 counts a signal value corresponding to each frame of sample frame in the sample signal, the calculating module 203 calculates a background signal value corresponding to the sample signal based on the counted signal value, and the wake-up module 204 wakes up the target device according to the background signal value when a voice wake-up operation triggered by the target device is detected.
It will be understood by those skilled in the art that all or part of the steps of the methods of the above embodiments may be performed by instructions or by associated hardware controlled by the instructions, which may be stored in a computer readable storage medium and loaded and executed by a processor.
An embodiment of the present invention further provides an electronic device 500, as shown in fig. 5, the electronic device 500 may integrate the voice wake-up apparatus, and may further include a Radio Frequency (RF) circuit 501, a memory 502 including one or more computer-readable storage media, an input unit 503, a display unit 504, a sensor 505, an audio circuit 506, a Wireless Fidelity (WiFi) module 507, a processor 508 including one or more processing cores, a power supply 509, and other components. Those skilled in the art will appreciate that the configuration of the electronic device 500 shown in FIG. 5 does not constitute a limitation of the electronic device 500, and may include more or fewer components than shown, or some components in combination, or a different arrangement of components. Wherein:
the RF circuit 501 may be used for receiving and transmitting signals during information transmission and reception or during a call, and in particular, for receiving downlink information of a base station and then sending the received downlink information to the one or more processors 508 for processing; in addition, data relating to uplink is transmitted to the base station. In general, RF circuit 501 includes, but is not limited to, an antenna, at least one amplifier, a tuner, one or more oscillators, a Subscriber Identity Module (SIM) card, a transceiver, a coupler, a Low Noise Amplifier (LNA), a duplexer, and the like. In addition, the RF circuitry 501 may also communicate with networks and other devices via wireless communications. The wireless communication may use any communication standard or protocol, including but not limited to Global system for Mobile communications (GSM), general Packet Radio Service (GPRS), code Division Multiple Access (CDMA), wideband Code Division Multiple Access (WCDMA), long Term Evolution (LTE), email, short Message Service (SMS), and the like.
The memory 502 may be used to store software programs and modules, and the processor 508 executes various functional applications and information processing by operating the software programs and modules stored in the memory 502. The memory 502 may mainly include a program storage area and a data storage area, wherein the program storage area may store an operating system, an application program required by at least one function (such as a sound playing function, a target data playing function, etc.), and the like; the storage data area may store data (such as audio data, a phonebook, etc.) created according to the use of the electronic device 500, and the like. Further, the memory 502 may include high speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device. Accordingly, the memory 502 may also include a memory controller to provide the processor 508 and the input unit 503 access to the memory 502.
The input unit 503 may be used to receive input numeric or character information and generate keyboard, mouse, joystick, optical or trackball signal inputs related to user settings and function control. In particular, in one particular embodiment, the input unit 503 may include a touch-sensitive surface as well as other input devices. The touch-sensitive surface, also referred to as a touch display screen or a touch pad, may collect touch operations by a user (such as operations by the user on or near the touch-sensitive surface using a finger, a stylus, or any other suitable object or attachment) thereon or nearby, and drive the corresponding connection device according to a predetermined program. Alternatively, the touch sensitive surface may comprise two parts, a touch detection means and a touch controller. The touch detection device detects the touch direction of a user, detects a signal brought by touch operation and transmits the signal to the touch controller; the touch controller receives touch information from the touch sensing device, converts the touch information into touch point coordinates, sends the touch point coordinates to the processor 508, and can receive and execute commands sent by the processor 508. In addition, touch sensitive surfaces may be implemented using various types of resistive, capacitive, infrared, and surface acoustic waves. The input unit 503 may include other input devices in addition to the touch-sensitive surface. In particular, other input devices may include, but are not limited to, one or more of a physical keyboard, function keys (such as volume control keys, switch keys, etc.), a trackball, a mouse, a joystick, and the like.
The display unit 504 may be used to display information input by or provided to the user as well as various graphical user interfaces of the electronic device 500, which may be made up of graphics, text, icons, video, and any combination thereof. The Display unit 504 may include a Display panel, and optionally, the Display panel may be configured in the form of a Liquid Crystal Display (LCD), an Organic Light-emitting diode (OLED), or the like. Further, the touch-sensitive surface may overlay the display panel, and when a touch operation is detected on or near the touch-sensitive surface, the touch operation is transmitted to the processor 508 to determine the type of touch event, and then the processor 508 provides a corresponding visual output on the display panel according to the type of touch event. Although in FIG. 5 the touch-sensitive surface and the display panel are two separate components to implement input and output functions, in some embodiments the touch-sensitive surface may be integrated with the display panel to implement input and output functions.
The electronic device 500 may also include at least one sensor 505, such as light sensors, motion sensors, and other sensors. Specifically, the light sensor may include an ambient light sensor that may adjust the brightness of the display panel according to the brightness of ambient light, and a proximity sensor that may turn off the display panel and/or the backlight when the electronic device 500 is moved to the ear. As one of the motion sensors, the gravitational acceleration sensor may detect the magnitude of acceleration in each direction (generally three axes), detect the magnitude and direction of gravity when the mobile phone is stationary, and may be used for applications of recognizing gestures of a mobile phone (such as horizontal and vertical screen switching, related games, magnetometer gesture calibration), vibration recognition related functions (such as pedometer and tapping), and other sensors such as a gyroscope, a barometer, a hygrometer, a thermometer, and an infrared sensor that may be configured to the electronic device 500, which are not described herein again.
Audio circuitry 506, a speaker, and a microphone may provide an audio interface between a user and the electronic device 500. The audio circuit 506 may transmit the electrical signal converted from the received audio data to a speaker, and convert the electrical signal into a sample signal for output; on the other hand, the microphone converts the collected sample signal into an electrical signal, which is received by the audio circuit 506 and converted into audio data, which is processed by the audio data output processor 508 and then sent to, for example, another electronic device 500 via the RF circuit 501, or output to the memory 502 for further processing. The audio circuit 506 may also include an earbud jack to provide communication of a peripheral headset with the electronic device 500.
WiFi belongs to short-distance wireless transmission technology, and the electronic device 500 can help the user send and receive e-mails, browse web pages, access streaming media, etc. through the WiFi module 507, which provides the user with wireless broadband internet access. Although fig. 5 shows the WiFi module 507, it is understood that it does not belong to the essential constitution of the electronic device 500, and may be omitted entirely as needed within the scope not changing the essence of the invention.
The processor 508 is a control center of the electronic device 500, connects various parts of the entire mobile phone using various interfaces and lines, performs various functions of the electronic device 500 and processes data by operating or executing software programs and/or modules stored in the memory 502 and calling data stored in the memory 502, thereby integrally monitoring the mobile phone. Optionally, processor 508 may include one or more processing cores; preferably, the processor 508 may integrate an application processor, which primarily handles operating systems, user interfaces, application programs, etc., and a modem processor, which primarily handles wireless communications. It will be appreciated that the modem processor described above may not be integrated into the processor 508.
The electronic device 500 further includes a power supply 509 (e.g., a battery) for powering the various components, which may be logically coupled to the processor 508 via a power management system to manage charging, discharging, and power consumption management functions via the power management system. The power supply 509 may also include any component of one or more dc or ac power sources, recharging systems, power failure detection circuitry, power converters or inverters, power data indicators, and the like.
Although not shown, the electronic device 500 may further include a camera, a bluetooth module, and the like, which are not described in detail herein. Specifically, in this embodiment, the processor 508 in the electronic device 500 loads the executable file corresponding to the process of one or more application programs into the memory 502 according to the following instructions, and the processor 508 runs the application programs stored in the memory 502, so as to implement various functions:
acquiring a preset storage capacity of a virtual memory; setting the initial phase as a preset value; starting the BIST circuit, and performing phase scanning under the control of the BIST circuit based on the set initial phase; when the BIST circuit is detected to scan to the termination phase, the optimal phase is calculated.
In the above embodiments, the descriptions of the embodiments have respective emphasis, and a part which is not described in detail in a certain embodiment may refer to the above detailed description of the voice wakeup method, and is not described herein again.
As can be seen from the above, the electronic device 500 according to the embodiment of the invention can perform phase scanning by using the BIST circuit, calculate the optimal phase based on the scanning result, and finally perform voice wake-up according to the optimal phase, thereby ensuring heavy load and performing voice wake-up quickly without adding additional circuits. .
It will be understood by those skilled in the art that all or part of the steps of the methods of the above embodiments may be performed by instructions or by associated hardware controlled by the instructions, which may be stored in a computer readable storage medium and loaded and executed by a processor.
To this end, an embodiment of the present application further provides a storage medium, on which a plurality of instructions are stored, where the instructions are suitable for being loaded by a processor to perform the steps in the above voice wake-up method.
The above operations can be implemented in the foregoing embodiments, and are not described in detail herein.
Wherein the storage medium may include: read Only Memory (ROM), random Access Memory (RAM), magnetic or optical disk, and the like.
Since the instructions stored in the storage medium can execute the steps in any voice wake-up method provided in the embodiments of the present invention, the beneficial effects that can be achieved by any voice wake-up method provided in the embodiments of the present invention can be achieved, which are detailed in the foregoing embodiments and will not be described herein again.
The voice wake-up method, apparatus, system and storage medium provided by the embodiments of the present invention are described in detail above, and a specific example is applied in the present disclosure to explain the principle and the implementation of the present invention, and the description of the above embodiments is only used to help understanding the method and the core idea of the present invention; meanwhile, for those skilled in the art, according to the idea of the present invention, the specific embodiments and the application range may be changed, and in summary, the content of the present specification should not be construed as limiting the present invention.

Claims (10)

1. A voice wake-up method, comprising:
collecting a sample signal within a preset time length;
counting signal values corresponding to each frame of sample frames in the sample signals;
calculating a background signal value corresponding to the sample signal based on the statistical signal value;
and when the voice awakening operation triggered aiming at the target equipment is detected, awakening the target equipment according to the background signal value.
2. The method of claim 1, wherein the calculating the background signal value corresponding to the sample signal based on the statistical signal value comprises:
acquiring a historical signal value in historical time;
and calculating a background signal value corresponding to the sample signal according to the fluctuation between the historical signal value and the statistical signal value.
3. The method of claim 2, wherein calculating the background signal value corresponding to the sample signal based on the fluctuation between the historical signal value and the statistical signal value comprises:
determining an initial signal value among the statistical signal values;
calculating a difference between the initial signal value and the historical signal value;
adjusting the historical signal value according to the difference value to obtain an adjusted signal value;
and adjusting the adjusted signal value according to the fluctuation among other signal values except the initial signal value to obtain a background signal value corresponding to the sample signal.
4. The method of claim 3, wherein said adjusting the historical signal value according to the difference value to obtain an adjusted signal value comprises:
when the difference value is detected to be larger than a threshold value, calculating the sum of the historical signal value and the preset value to obtain an adjusted signal value;
and when the difference value is smaller than a threshold value, calculating the difference between the historical signal value and the preset value to obtain an adjusted signal value.
5. The method according to any one of claims 1 to 4, wherein the waking up the target device according to the background signal value when detecting a voice wake-up operation triggered for the target device comprises:
when voice awakening operation triggered by target equipment is detected, acquiring an operation signal value corresponding to the voice awakening operation;
detecting whether the operation signal value is greater than the background signal value;
when the operating signal value is detected to be larger than the background signal value, the target device is awakened.
6. The method of any of claims 1 to 4, further comprising:
periodically updating the background signal value to obtain an updated background signal value;
when detecting a voice wake-up operation triggered for a target device, waking up the target device according to the background signal value, including: and when the voice awakening operation triggered aiming at the target equipment is detected, awakening the target equipment according to the updated background signal value.
7. A voice wake-up apparatus, comprising:
the acquisition module is used for acquiring a sample signal within a preset time length;
the statistical module is used for counting the signal value corresponding to each frame of sample frame in the sample signal;
the calculating module is used for calculating a background signal value corresponding to the sample signal based on the statistical signal value;
and the awakening module is used for awakening the target equipment according to the background signal value when voice awakening operation triggered by the target equipment is detected.
8. The apparatus of claim 7, wherein the computing module comprises:
an acquisition unit configured to acquire a history signal value in a history time;
and the calculating unit is used for calculating a background signal value corresponding to the sample signal according to the fluctuation between the historical signal value and the statistical signal value.
9. An electronic device comprising a memory and a processor, the memory storing a computer program, wherein the processor when executing the computer program implements the steps of the voice wake-up method according to any of claims 1 to 6.
10. A readable storage medium, having stored thereon a computer program which, when being executed by a processor, carries out the steps of the voice wake-up method according to any one of claims 1 to 6.
CN202310273455.XA 2023-03-21 2023-03-21 Voice wakeup method and device, electronic equipment and readable storage medium Active CN115985323B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310273455.XA CN115985323B (en) 2023-03-21 2023-03-21 Voice wakeup method and device, electronic equipment and readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310273455.XA CN115985323B (en) 2023-03-21 2023-03-21 Voice wakeup method and device, electronic equipment and readable storage medium

Publications (2)

Publication Number Publication Date
CN115985323A true CN115985323A (en) 2023-04-18
CN115985323B CN115985323B (en) 2023-06-16

Family

ID=85959970

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310273455.XA Active CN115985323B (en) 2023-03-21 2023-03-21 Voice wakeup method and device, electronic equipment and readable storage medium

Country Status (1)

Country Link
CN (1) CN115985323B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116456441A (en) * 2023-06-16 2023-07-18 荣耀终端有限公司 Sound processing device, sound processing method and electronic equipment

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103943105A (en) * 2014-04-18 2014-07-23 安徽科大讯飞信息科技股份有限公司 Voice interaction method and system
US20140249812A1 (en) * 2013-03-04 2014-09-04 Conexant Systems, Inc. Robust speech boundary detection system and method
CN110211579A (en) * 2019-04-28 2019-09-06 北京蓦然认知科技有限公司 A kind of voice instruction recognition method, apparatus and system
US20190349663A1 (en) * 2018-05-09 2019-11-14 Tymphany Acoustic Technology (Huizhou) Co., Ltd. System interacting with smart audio device
CN110600048A (en) * 2019-08-23 2019-12-20 Oppo广东移动通信有限公司 Audio verification method and device, storage medium and electronic equipment
CN111261143A (en) * 2018-12-03 2020-06-09 杭州嘉楠耘智信息科技有限公司 Voice wake-up method and device and computer readable storage medium
WO2023029615A1 (en) * 2021-08-30 2023-03-09 华为技术有限公司 Wake-on-voice method and apparatus, device, storage medium, and program product

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140249812A1 (en) * 2013-03-04 2014-09-04 Conexant Systems, Inc. Robust speech boundary detection system and method
CN103943105A (en) * 2014-04-18 2014-07-23 安徽科大讯飞信息科技股份有限公司 Voice interaction method and system
US20190349663A1 (en) * 2018-05-09 2019-11-14 Tymphany Acoustic Technology (Huizhou) Co., Ltd. System interacting with smart audio device
CN111261143A (en) * 2018-12-03 2020-06-09 杭州嘉楠耘智信息科技有限公司 Voice wake-up method and device and computer readable storage medium
CN110211579A (en) * 2019-04-28 2019-09-06 北京蓦然认知科技有限公司 A kind of voice instruction recognition method, apparatus and system
CN110600048A (en) * 2019-08-23 2019-12-20 Oppo广东移动通信有限公司 Audio verification method and device, storage medium and electronic equipment
WO2023029615A1 (en) * 2021-08-30 2023-03-09 华为技术有限公司 Wake-on-voice method and apparatus, device, storage medium, and program product

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116456441A (en) * 2023-06-16 2023-07-18 荣耀终端有限公司 Sound processing device, sound processing method and electronic equipment
CN116456441B (en) * 2023-06-16 2023-10-31 荣耀终端有限公司 Sound processing device, sound processing method and electronic equipment

Also Published As

Publication number Publication date
CN115985323B (en) 2023-06-16

Similar Documents

Publication Publication Date Title
KR102354275B1 (en) Speech recognition method and apparatus, and storage medium
CN109509473B (en) Voice control method and terminal equipment
CN111324235A (en) Screen refreshing frequency adjusting method and electronic equipment
CN111954286B (en) Power adjusting method, device, storage medium and terminal
CN110022405B (en) Radio frequency parameter adjusting method, device and storage medium
CN115985323B (en) Voice wakeup method and device, electronic equipment and readable storage medium
CN110335629B (en) Pitch recognition method and device of audio file and storage medium
CN116486833B (en) Audio gain adjustment method and device, storage medium and electronic equipment
CN113393838A (en) Voice processing method and device, computer readable storage medium and computer equipment
CN110688339B (en) Data transmission method and device, mobile terminal and storage medium
CN107066327B (en) Timing task processing method, mobile terminal and computer readable storage medium
CN114582264A (en) Display brightness control method and device, electronic equipment and storage medium
CN115995231B (en) Voice wakeup method and device, electronic equipment and readable storage medium
CN108287755B (en) Application program control method and mobile terminal
CN108681467B (en) System awakening method and terminal
CN112468870A (en) Video playing method, device, equipment and storage medium
CN111966206A (en) Power-saving control method and device for fingerprint sensor and mobile terminal
CN111627384A (en) Display picture compensation method and device, electronic equipment and storage medium
CN109561481B (en) Data sending method, terminal and storage medium
CN110891304B (en) gPS power consumption control method for mobile terminal
CN112433694B (en) Light intensity adjusting method and device, storage medium and mobile terminal
CN111614839B (en) Antenna self-adaptive control method and device
CN113110775B (en) Information carousel method, related device, equipment and storage medium
CN111654578B (en) Mobile terminal sensor restoration method
CN110972221B (en) Communication signal processing method and device and mobile terminal

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant