CN108922522B - Device control method, device, storage medium, and electronic apparatus - Google Patents

Device control method, device, storage medium, and electronic apparatus Download PDF

Info

Publication number
CN108922522B
CN108922522B CN201810804492.8A CN201810804492A CN108922522B CN 108922522 B CN108922522 B CN 108922522B CN 201810804492 A CN201810804492 A CN 201810804492A CN 108922522 B CN108922522 B CN 108922522B
Authority
CN
China
Prior art keywords
audio signal
frequency spectrum
equipment
voice
controlling
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810804492.8A
Other languages
Chinese (zh)
Other versions
CN108922522A (en
Inventor
张新
王慧君
王子
韩雪
李保水
文皓
毛跃辉
廖湖锋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Gree Electric Appliances Inc of Zhuhai
Original Assignee
Gree Electric Appliances Inc of Zhuhai
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Gree Electric Appliances Inc of Zhuhai filed Critical Gree Electric Appliances Inc of Zhuhai
Priority to CN201810804492.8A priority Critical patent/CN108922522B/en
Publication of CN108922522A publication Critical patent/CN108922522A/en
Application granted granted Critical
Publication of CN108922522B publication Critical patent/CN108922522B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1822Parsing for meaning understanding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L15/222Barge in, i.e. overridable guidance for interrupting prompts
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Abstract

The invention provides a control method and a control device of equipment, a storage medium and an electronic device, wherein the control method of the equipment comprises the following steps: analyzing the semantics of the obtained audio signal; analyzing the frequency spectrum of the audio signal under the condition that the semantic meaning comprises a preset awakening word, wherein the preset awakening word comprises a command for controlling the equipment to operate; and controlling the operation of the equipment by using the audio signal under the condition that the matching degree of the frequency spectrum is greater than the frequency spectrum threshold value. Therefore, the problem that the voice control equipment in the related technology is inaccurate can be solved, and the effect of accurately controlling the equipment by using the voice is achieved.

Description

Device control method, device, storage medium, and electronic apparatus
Technical Field
The present invention relates to the field of communications, and in particular, to a method and an apparatus for controlling a device, a storage medium, and an electronic apparatus.
Background
The voice air conditioner is more and more popular, and the voice air conditioner is usually awakened by recording a segment of audio recognition awakening words in the market. Since people also talk about the awakening word in the communication process, the awakening word is mistakenly recognized, and troubles are caused to users.
In view of the above technical problems, no effective solution has been proposed in the related art.
Disclosure of Invention
The embodiment of the invention provides a device control method, a device, a storage medium and an electronic device, which are used for at least solving the problem of inaccurate voice control device in the related art.
According to an embodiment of the present invention, there is provided a control method of a device including: analyzing the semantics of the obtained audio signal; analyzing the frequency spectrum of the audio signal under the condition that the semantic meaning comprises a preset awakening word, wherein the preset awakening word comprises a command for controlling the equipment to operate; and controlling the operation of the equipment by using the audio signal under the condition that the matching degree of the frequency spectrum is greater than a frequency spectrum threshold value.
Optionally, parsing the semantics in the acquired audio signal comprises: collecting voice signals input by a user; removing the non-human voice signals in the voice signals to obtain the audio signals; parsing semantics in the audio signal.
Optionally, under the condition that the preset wake-up word is included in the semantic meaning, analyzing the frequency spectrum in the audio signal includes: resolving a pitch and/or tempo in the audio signal; performing a spectral analysis of the tones and/or tempos.
Optionally, in a case that the matching degree of the frequency spectrum is greater than a frequency spectrum threshold, controlling the operation of the device using the audio signal includes: speech encoding the audio signal; and sending the voice code to a voice control center of the equipment, and instructing the voice control equipment to respond to the voice code so as to control the operation of the equipment.
According to another embodiment of the present invention, there is also provided a control method of an apparatus, including: analyzing the frequency spectrum of the acquired audio signal; analyzing semantics in the audio signal under the condition that the matching degree of the frequency spectrum is greater than a frequency spectrum threshold value; and under the condition that the semantics comprise a preset awakening word, controlling the operation of the equipment by using the preset awakening word, wherein the preset awakening word comprises a command for controlling the operation of the equipment.
According to another embodiment of the present invention, there is also provided a control apparatus of a device, including: the first analysis module is used for analyzing the semantics in the acquired audio signal; the first analysis module is used for analyzing the frequency spectrum of the audio signal under the condition that the semantic meaning comprises a preset awakening word, wherein the preset awakening word comprises a command for controlling equipment to operate; and the first control module is used for controlling the operation of the equipment by using the audio signal under the condition that the matching degree of the frequency spectrum is greater than a frequency spectrum threshold value.
Optionally, the first parsing module includes: the first acquisition unit is used for acquiring a voice signal input by a user; the first determining unit is used for removing the non-human voice signals in the voice signals to obtain the audio signals; a first parsing unit for parsing semantics in the audio signal.
According to another embodiment of the present invention, there is also provided a control apparatus of a device, including: the second analysis module is used for analyzing the frequency spectrum of the acquired audio signal; the second analysis module is used for analyzing the semantics in the audio signal under the condition that the matching degree of the frequency spectrum is greater than a frequency spectrum threshold value; and the second control module is used for controlling the operation of the equipment by using the preset awakening word under the condition that the semantic meaning comprises the preset awakening word, wherein the preset awakening word comprises a command for controlling the operation of the equipment.
According to a further embodiment of the present invention, there is also provided a storage medium having a computer program stored therein, wherein the computer program is arranged to perform the steps of any of the above method embodiments when executed.
According to yet another embodiment of the present invention, there is also provided an electronic device, including a memory in which a computer program is stored and a processor configured to execute the computer program to perform the steps in any of the above method embodiments.
According to the invention, after the voice is received, the semantic analysis is carried out on the audio signal in the voice, and the frequency spectrum of the audio signal is analyzed under the condition that the semantic includes the preset awakening word, wherein the preset awakening word includes a command for controlling the operation of equipment; and controlling the operation of the equipment by using the audio signal under the condition that the matching degree of the frequency spectrum is greater than the frequency spectrum threshold value. I.e. the wake-up word and the spectrum in the audio signal are combined to control the device. Therefore, the problem that the voice control equipment in the related technology is inaccurate can be solved, and the effect of accurately controlling the equipment by using the voice is achieved.
Drawings
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the invention and together with the description serve to explain the invention without limiting the invention. In the drawings:
fig. 1 is a block diagram of a hardware configuration of a mobile terminal of a device control method according to an embodiment of the present invention;
fig. 2 is a flowchart (one) of a control method of an apparatus according to an embodiment of the present invention;
fig. 3 is a flowchart (two) of a control method of an apparatus according to an embodiment of the present invention;
FIG. 4 is a block diagram (one) of the control apparatus of the device according to the embodiment of the present invention;
FIG. 5 is a block diagram of the structure of a control device of the apparatus according to an embodiment of the present invention;
FIG. 6 is a flowchart (one) of the wake-up operation of the voice air conditioner in the present embodiment;
fig. 7 is a flowchart of the wake-up operation of the voice air conditioner in this embodiment (two).
Detailed Description
The invention will be described in detail hereinafter with reference to the accompanying drawings in conjunction with embodiments. It should be noted that the embodiments and features of the embodiments in the present application may be combined with each other without conflict.
It should be noted that the terms "first," "second," and the like in the description and claims of the present invention and in the drawings described above are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order.
Example 1
The method provided by the first embodiment of the present application may be executed in a mobile terminal, a computer terminal, or a similar computing device. Taking an example of the present invention running on a mobile terminal, fig. 1 is a block diagram of a hardware structure of the mobile terminal of a device control method according to an embodiment of the present invention. As shown in fig. 1, the mobile terminal 10 may include one or more (only one shown in fig. 1) processors 102 (the processor 102 may include, but is not limited to, a processing device such as a microprocessor MCU or a programmable logic device FPGA) and a memory 104 for storing data, and optionally may also include a transmission device 106 for communication functions and an input-output device 108. It will be understood by those skilled in the art that the structure shown in fig. 1 is only an illustration, and does not limit the structure of the mobile terminal. For example, the mobile terminal 10 may also include more or fewer components than shown in FIG. 1, or have a different configuration than shown in FIG. 1.
The memory 104 may be used for storing computer programs, for example, software programs and modules of application software, such as computer programs corresponding to the control method of the device in the embodiment of the present invention, and the processor 102 executes various functional applications and data processing by running the computer programs stored in the memory 104, so as to implement the method described above. The memory 104 may include high speed random access memory, and may also include non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid-state memory. In some instances, the memory 104 may further include memory located remotely from the processor 102, which may be connected to the mobile terminal 10 via a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The transmission device 106 is used for receiving or transmitting data via a network. Specific examples of the network described above may include a wireless network provided by a communication provider of the mobile terminal 10. In one example, the transmission device 106 includes a Network adapter (NIC), which can be connected to other Network devices through a base station so as to communicate with the internet. In one example, the transmission device 106 may be a Radio Frequency (RF) module, which is used for communicating with the internet in a wireless manner.
In the present embodiment, a control method of a device is provided, and fig. 2 is a flowchart (a) of the control method of the device according to the embodiment of the present invention, as shown in fig. 2, the flowchart includes the following steps:
step S202, analyzing the semanteme in the obtained audio signal;
step S204, under the condition that the semantic meaning comprises a preset awakening word, analyzing the frequency spectrum of the audio signal, wherein the preset awakening word comprises a command for controlling the equipment to operate;
and step S206, controlling the operation of the equipment by using the audio signal under the condition that the matching degree of the frequency spectrum is greater than the frequency spectrum threshold value.
Through the steps, after the voice is received, semantic analysis is carried out on the audio signal in the voice, and under the condition that the semantic includes the preset awakening word, the frequency spectrum of the audio signal is analyzed, wherein the preset awakening word includes a command for controlling the equipment to operate; and controlling the operation of the equipment by using the audio signal under the condition that the matching degree of the frequency spectrum is greater than the frequency spectrum threshold value. I.e. the wake-up word and the spectrum in the audio signal are combined to control the device. Therefore, the problem that the voice control equipment in the related technology is inaccurate can be solved, and the effect of accurately controlling the equipment by using the voice is achieved.
Optionally, the executing subject of the above steps may be a device, such as an air conditioner, a household electrical appliance such as a television, and the like, but is not limited thereto.
In this embodiment, the audio signal is a spoken voice, and the voice includes a preset wake-up word of the wake-up device and an audio signal spectrum. Wherein, predetermine awakening word and can be "open the air conditioner", "close the air conditioner", "turn up the air conditioner degree" etc. and the frequency spectrum in the audio signal includes: the pitch and/or tempo are analyzed, in particular, the pitch and/or tempo in the audio signal is analyzed, and the frequency spectrum of the pitch and/or tempo is analyzed. I.e. to determine the degree of matching of the frequency spectrum in the acquired audio signal with the frequency spectrum stored in the device.
In an alternative embodiment, the semantics in the acquired audio signal are resolved by: collecting voice signals input by a user; removing non-human voice signals in the voice signals to obtain audio signals; parsing semantics in the audio signal. In this embodiment, the non-human voice signal includes some noisy signals, and the semantics mainly include voices for controlling the device, such as "turn on air conditioner", "turn off air conditioner", "turn up air conditioner degree", and the like.
In an alternative embodiment, the operation of the device is controlled by the audio signal in the case that the degree of matching of the frequency spectrum is greater than a frequency spectrum threshold by: performing speech coding on the audio signal; the speech coding is transmitted to a speech control center of the device instructing the speech control device to respond to the speech coding to control operation of the device. In this embodiment, a voice control center in the device controls each function of the device, matches the obtained voice code with the related function, and controls the operation of each function.
In the present embodiment, a control method of a device is provided, and fig. 3 is a flowchart (two) of the control method of the device according to the embodiment of the present invention, and as shown in fig. 3, the flowchart includes the following steps:
step S302, analyzing the frequency spectrum of the obtained audio signal;
step S304, under the condition that the matching degree of the frequency spectrum is greater than a frequency spectrum threshold value, analyzing the semantics in the audio signal;
step S306, under the condition that the semantic meaning includes the preset awakening word, the preset awakening word is used for controlling the operation of the equipment, wherein the preset awakening word includes a command for controlling the operation of the equipment.
Through the steps, after the voice is received, the frequency spectrum of the audio signal is analyzed, under the condition that the matching degree of the frequency spectrum is larger than the frequency spectrum threshold value, the semantic analysis is carried out on the audio signal in the voice, and under the condition that the semantic includes the preset awakening word, the operation of the equipment is controlled by using the preset awakening word. I.e. the wake-up word and the spectrum in the audio signal are combined to control the device. Therefore, the problem that the voice control equipment in the related technology is inaccurate can be solved, and the effect of accurately controlling the equipment by using the voice is achieved.
Optionally, the executing subject of the above steps may be a device, such as an air conditioner, a household electrical appliance such as a television, and the like, but is not limited thereto.
In this embodiment, the spectrum is analyzed first, and then the semantic meaning is analyzed, and the audio and semantic meaning are also combined.
Fig. 4 is a block diagram (one) of the structure of a control device of an apparatus according to an embodiment of the present invention, as shown in fig. 4, the device includes: a first analysis module 42, a first analysis module 44 and a first control module 46, which are described in detail below:
a first parsing module 42, configured to parse semantics in the acquired audio signal;
a first analyzing module 44, connected to the first analyzing module 42, for analyzing the frequency spectrum of the audio signal if the semantic includes a preset wake-up word, where the preset wake-up word includes a command for controlling the operation of the device;
and a first control module 46, connected to the first analysis module 44, for controlling the operation of the device by using the audio signal if the matching degree of the frequency spectrum is greater than the frequency spectrum threshold.
In an alternative embodiment, the first parsing module 42 includes: the first acquisition unit is used for acquiring a voice signal input by a user; the first determining unit is connected to the first acquisition unit and used for removing the non-human voice signals in the voice signals to obtain audio signals; and the first analysis unit is connected to the first determination unit and used for analyzing the semantics in the audio signal.
In an alternative embodiment, the first analysis module 44 analyzes the spectrum in the audio signal in case the preset wake up word is included in the semantics by: resolving a pitch and/or tempo in the audio signal; the pitch and/or tempo is spectrally analyzed.
In an alternative embodiment, the first control module 46 controls the operation of the device with the audio signal in the case that the matching degree of the frequency spectrum is greater than the frequency spectrum threshold by: performing speech coding on the audio signal; the speech coding is transmitted to a speech control center of the device instructing the speech control device to respond to the speech coding to control operation of the device.
Fig. 5 is a block diagram (ii) of the structure of a control apparatus of a device according to an embodiment of the present invention, as shown in fig. 5, the apparatus including: a second analysis module 52, a second analysis module 54, and a second control module 56, which are described in detail below:
a second analysis module 52 for analyzing the frequency spectrum of the acquired audio signal; a second parsing module 54, connected to the second parsing module 52, for parsing the semantics in the audio signal if the matching degree of the spectrum is greater than the spectrum threshold; and a second control module 56, connected to the second parsing module 54, configured to control operation of the device by using a preset wake-up word when the semantic includes the preset wake-up word, where the preset wake-up word includes a command for controlling operation of the device.
It should be noted that, the above modules may be implemented by software or hardware, and for the latter, the following may be implemented, but not limited to: the modules are all positioned in the same processor; alternatively, the modules are respectively located in different processors in any combination.
The present invention will be described in detail with reference to the following specific examples:
the present invention is described below by taking an air conditioner as an example:
the voice air conditioner is more and more popular, and the voice air conditioner is usually awakened by recording a segment of audio recognition awakening words in the market. Since people also talk about the awakening word in the communication process, the awakening word is mistakenly recognized, and troubles are caused to users.
In fact, the audio is in the form of sound waves spread in the time dimension, and whether the audio is similar or the same can be judged by analyzing the wave fronts (ripples, waves) and comparing the wave fronts (ripples, waves). The analogy to the KTV scoring software and the song listening and identifying software is based on the pitch, rhythm and your score and judge which song is, which is identified by the pop (or wave, waveform) of the audio.
The traditional voice air conditioner is only awakened by awakening words and phrases, so that false awakening is easy to occur, and the recognition rate is low; the traditional air conditioner awakening mode is uninteresting, and if the traditional air conditioner awakening mode is used for simultaneously identifying the tone and the rhythm through awakening word interception. Based on this, the present embodiment provides a way that the keyword and its tone and rhythm reach the predetermined value at the same time (instead of just meeting the keyword), and the air conditioner can be woken up.
The embodiment mainly comprises the following steps: the threshold value and the calculation method for waking up the air conditioner by combining the wake-up word with the tone and rhythm are customized for the voice air conditioner, so that a user can wake up the air conditioner in a singing input mode, the error recognition rate is reduced, and the wake-up interestingness is improved.
Fig. 6 is a flowchart (one) of the wake-up work of the voice air conditioner in this embodiment, which specifically includes the following steps:
s601: acquiring a current audio signal input by a user;
s602: and after the non-human voice is removed, analyzing the semantics in the audio signal. And forming a command word after the semantic analysis is successful, and comparing the command word with a preset awakening word.
S603: when the command word contains the preset awakening word, a series of signal processing methods are adopted to carry out spectrum matching analysis on the audio signal. And if the audio signal matching degree is greater than the threshold value, the voice awakening command is considered to be successfully identified.
S604: and acquiring a voice awakening command code, and sending the awakening command code to the air conditioner, wherein the air conditioner has the function of awakening the voice.
The embodiment solves the technical problem that the awakening mode of the voice control mode in the prior art easily causes mistaken awakening or recognition rate, and achieves the effect of simply, accurately and interestingly awakening the air conditioner.
Fig. 7 is a flowchart (ii) of the wake-up work of the voice air conditioner in this embodiment, which specifically includes the following steps:
s701: voice collection;
s702: removing the non-human voice;
s703: judging whether voice exists or not, if so, turning to S704, otherwise, turning to S701;
s704: and (3) semantic parsing:
s705: judging whether the word is a wake-up word, if so, turning to S706, otherwise, turning to S703;
s706: processing the audio signal;
s707: judging whether the matching degree of the tone and the rhythm in the audio signal is greater than a threshold value, if so, turning to S708, otherwise, S701;
s708: recognizing a voice wake-up command;
s709: acquiring a wake-up command code;
s710: the air conditioner executes the wake-up command code.
In conclusion, due to the adoption of the awakening mode of combining the awakening words with the tone and the rhythm, compared with the traditional voice air conditioner, the identification rate and the awakening efficiency are improved, and the interestingness is increased.
Alternatively, in the present embodiment, the storage medium may be configured to store a computer program for executing the above steps.
Optionally, in this embodiment, the storage medium may include, but is not limited to: various media capable of storing computer programs, such as a usb disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a removable hard disk, a magnetic disk, or an optical disk.
Embodiments of the present invention also provide an electronic device comprising a memory having a computer program stored therein and a processor arranged to run the computer program to perform the steps of any of the above method embodiments.
Optionally, the electronic apparatus may further include a transmission device and an input/output device, wherein the transmission device is connected to the processor, and the input/output device is connected to the processor.
Optionally, the specific examples in this embodiment may refer to the examples described in the above embodiments and optional implementation manners, and this embodiment is not described herein again.
It will be apparent to those skilled in the art that the modules or steps of the present invention described above may be implemented by a general purpose computing device, they may be centralized on a single computing device or distributed across a network of multiple computing devices, and alternatively, they may be implemented by program code executable by a computing device, such that they may be stored in a storage device and executed by a computing device, and in some cases, the steps shown or described may be performed in an order different than that described herein, or they may be separately fabricated into individual integrated circuit modules, or multiple ones of them may be fabricated into a single integrated circuit module. Thus, the present invention is not limited to any specific combination of hardware and software.
The above description is only a preferred embodiment of the present invention and is not intended to limit the present invention, and various modifications and changes may be made by those skilled in the art. Any modification, equivalent replacement, or improvement made within the principle of the present invention should be included in the protection scope of the present invention.

Claims (8)

1. A method of controlling a device, comprising:
analyzing the semantics of the obtained audio signal;
analyzing the frequency spectrum of the audio signal under the condition that the semantic meaning comprises a preset awakening word, wherein the preset awakening word comprises a command for controlling the equipment to operate;
controlling the operation of the equipment by using the audio signal under the condition that the matching degree of the frequency spectrum is greater than a frequency spectrum threshold value;
analyzing the frequency spectrum in the audio signal comprises, in the case that the preset wake-up word is included in the semantic meaning: resolving a pitch and/or tempo in the audio signal; performing spectral analysis on the tones and/or tempos;
in the case that the matching degree of the frequency spectrum is greater than a frequency spectrum threshold value, controlling the operation of the device by using the audio signal comprises: speech encoding the audio signal; and sending the voice code to a voice control center of the equipment, and instructing the voice control equipment to respond to the voice code so as to control the operation of the equipment.
2. The method of claim 1, wherein parsing semantics in the acquired audio signal comprises:
collecting voice signals input by a user;
removing the non-human voice signals in the voice signals to obtain the audio signals;
parsing semantics in the audio signal.
3. A method of controlling a device, comprising:
analyzing the frequency spectrum of the acquired audio signal;
analyzing semantics in the audio signal under the condition that the matching degree of the frequency spectrum is greater than a frequency spectrum threshold value;
under the condition that the semantics comprise a preset awakening word, controlling the operation of the equipment by using the preset awakening word, wherein the preset awakening word comprises a command for controlling the operation of the equipment;
analyzing the frequency spectrum in the audio signal comprises, in the case that the preset wake-up word is included in the semantic meaning: resolving a pitch and/or tempo in the audio signal; performing spectral analysis on the tones and/or tempos;
in the case that the matching degree of the frequency spectrum is greater than a frequency spectrum threshold value, controlling the operation of the device by using the audio signal comprises: speech encoding the audio signal; and sending the voice code to a voice control center of the equipment, and instructing the voice control equipment to respond to the voice code so as to control the operation of the equipment.
4. A control apparatus of a device, characterized by comprising:
the first analysis module is used for analyzing the semantics in the acquired audio signal;
the first analysis module is used for analyzing the frequency spectrum of the audio signal under the condition that the semantic meaning comprises a preset awakening word, wherein the preset awakening word comprises a command for controlling equipment to operate;
the first control module is used for controlling the operation of the equipment by using the audio signal under the condition that the matching degree of the frequency spectrum is greater than a frequency spectrum threshold value;
analyzing the frequency spectrum in the audio signal comprises, in the case that the preset wake-up word is included in the semantic meaning: resolving a pitch and/or tempo in the audio signal; performing spectral analysis on the tones and/or tempos;
in the case that the matching degree of the frequency spectrum is greater than a frequency spectrum threshold value, controlling the operation of the device by using the audio signal comprises: speech encoding the audio signal; and sending the voice code to a voice control center of the equipment, and instructing the voice control equipment to respond to the voice code so as to control the operation of the equipment.
5. The apparatus of claim 4, wherein the first parsing module comprises:
the first acquisition unit is used for acquiring a voice signal input by a user;
the first determining unit is used for removing the non-human voice signals in the voice signals to obtain the audio signals;
a first parsing unit for parsing semantics in the audio signal.
6. A control apparatus of a device, characterized by comprising:
the second analysis module is used for analyzing the frequency spectrum of the acquired audio signal;
the second analysis module is used for analyzing the semantics in the audio signal under the condition that the matching degree of the frequency spectrum is greater than a frequency spectrum threshold value;
the second control module is used for controlling the operation of the equipment by using a preset awakening word under the condition that the semantic meaning comprises the preset awakening word, wherein the preset awakening word comprises a command for controlling the operation of the equipment;
analyzing the frequency spectrum in the audio signal comprises, in the case that the preset wake-up word is included in the semantic meaning: resolving a pitch and/or tempo in the audio signal; performing spectral analysis on the tones and/or tempos;
in the case that the matching degree of the frequency spectrum is greater than a frequency spectrum threshold value, controlling the operation of the device by using the audio signal comprises: speech encoding the audio signal; and sending the voice code to a voice control center of the equipment, and instructing the voice control equipment to respond to the voice code so as to control the operation of the equipment.
7. A storage medium having a computer program stored thereon, wherein the computer program is arranged to perform the method of claims 1 to 2 or 3 when executed.
8. An electronic device comprising a memory and a processor, wherein the memory has stored therein a computer program, and wherein the processor is arranged to execute the computer program to perform the method of claims 1 to 2, or claim 3.
CN201810804492.8A 2018-07-20 2018-07-20 Device control method, device, storage medium, and electronic apparatus Active CN108922522B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810804492.8A CN108922522B (en) 2018-07-20 2018-07-20 Device control method, device, storage medium, and electronic apparatus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810804492.8A CN108922522B (en) 2018-07-20 2018-07-20 Device control method, device, storage medium, and electronic apparatus

Publications (2)

Publication Number Publication Date
CN108922522A CN108922522A (en) 2018-11-30
CN108922522B true CN108922522B (en) 2020-08-11

Family

ID=64416806

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810804492.8A Active CN108922522B (en) 2018-07-20 2018-07-20 Device control method, device, storage medium, and electronic apparatus

Country Status (1)

Country Link
CN (1) CN108922522B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110706696A (en) * 2019-09-25 2020-01-17 珠海格力电器股份有限公司 Voice control method and device
CN110808030B (en) * 2019-11-22 2021-01-22 珠海格力电器股份有限公司 Voice awakening method, system, storage medium and electronic equipment
CN111240634A (en) * 2020-01-08 2020-06-05 百度在线网络技术(北京)有限公司 Sound box working mode adjusting method and device
CN113518202A (en) * 2021-04-07 2021-10-19 华北电力大学扬中智能电气研究中心 Security monitoring method and device, electronic equipment and storage medium

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104811777A (en) * 2014-01-23 2015-07-29 阿里巴巴集团控股有限公司 Smart television voice processing method, smart television voice processing system and smart television
CN107016993A (en) * 2017-05-15 2017-08-04 成都铅笔科技有限公司 The voice interactive system and method for a kind of smart home
CN108062354A (en) * 2017-11-22 2018-05-22 上海博泰悦臻电子设备制造有限公司 Information recommendation method, system, storage medium, electronic equipment and vehicle

Also Published As

Publication number Publication date
CN108922522A (en) 2018-11-30

Similar Documents

Publication Publication Date Title
CN108922522B (en) Device control method, device, storage medium, and electronic apparatus
CN107623614B (en) Method and device for pushing information
CN110134022B (en) Sound control method and device of intelligent household equipment and electronic device
CN107147618B (en) User registration method and device and electronic equipment
CN111223497B (en) Nearby wake-up method and device for terminal, computing equipment and storage medium
CN109584876B (en) Voice data processing method and device and voice air conditioner
US20170140750A1 (en) Method and device for speech recognition
CN108335695B (en) Voice control method, device, computer equipment and storage medium
CN109065051B (en) Voice recognition processing method and device
CN109493849A (en) Voice awakening method, device and electronic equipment
CN104575504A (en) Method for personalized television voice wake-up by voiceprint and voice identification
CN102404278A (en) Song request system based on voiceprint recognition and application method thereof
CN110751960B (en) Method and device for determining noise data
CN106992008A (en) Processing method and electronic equipment
CN108932947B (en) Voice control method and household appliance
CN111178081B (en) Semantic recognition method, server, electronic device and computer storage medium
CN110688518A (en) Rhythm point determining method, device, equipment and storage medium
EP4033483A2 (en) Method and apparatus for testing vehicle-mounted voice device, electronic device and storage medium
CN111145763A (en) GRU-based voice recognition method and system in audio
WO2014173325A1 (en) Gutturophony recognition method and device
CN109347708A (en) A kind of audio recognition method, device, household appliance, Cloud Server and medium
CN115457938A (en) Method, device, storage medium and electronic device for identifying awakening words
CN112185425A (en) Audio signal processing method, device, equipment and storage medium
CN110992940B (en) Voice interaction method, device, equipment and computer-readable storage medium
EP3059731A1 (en) Method and apparatus for automatically sending multimedia file, mobile terminal, and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant