CN108922522B

CN108922522B - Device control method, device, storage medium, and electronic apparatus

Info

Publication number: CN108922522B
Application number: CN201810804492.8A
Authority: CN
Inventors: 张新; 王慧君; 王子; 韩雪; 李保水; 文皓; 毛跃辉; 廖湖锋
Original assignee: Gree Electric Appliances Inc of Zhuhai
Current assignee: Gree Electric Appliances Inc of Zhuhai
Priority date: 2018-07-20
Filing date: 2018-07-20
Publication date: 2020-08-11
Anticipated expiration: 2038-07-20
Also published as: CN108922522A

Abstract

The invention provides a control method and a control device of equipment, a storage medium and an electronic device, wherein the control method of the equipment comprises the following steps: analyzing the semantics of the obtained audio signal; analyzing the frequency spectrum of the audio signal under the condition that the semantic meaning comprises a preset awakening word, wherein the preset awakening word comprises a command for controlling the equipment to operate; and controlling the operation of the equipment by using the audio signal under the condition that the matching degree of the frequency spectrum is greater than the frequency spectrum threshold value. Therefore, the problem that the voice control equipment in the related technology is inaccurate can be solved, and the effect of accurately controlling the equipment by using the voice is achieved.

Description

Device control method, device, storage medium, and electronic apparatus

Technical Field

The present invention relates to the field of communications, and in particular, to a method and an apparatus for controlling a device, a storage medium, and an electronic apparatus.

Background

The voice air conditioner is more and more popular, and the voice air conditioner is usually awakened by recording a segment of audio recognition awakening words in the market. Since people also talk about the awakening word in the communication process, the awakening word is mistakenly recognized, and troubles are caused to users.

In view of the above technical problems, no effective solution has been proposed in the related art.

Disclosure of Invention

The embodiment of the invention provides a device control method, a device, a storage medium and an electronic device, which are used for at least solving the problem of inaccurate voice control device in the related art.

According to an embodiment of the present invention, there is provided a control method of a device including: analyzing the semantics of the obtained audio signal; analyzing the frequency spectrum of the audio signal under the condition that the semantic meaning comprises a preset awakening word, wherein the preset awakening word comprises a command for controlling the equipment to operate; and controlling the operation of the equipment by using the audio signal under the condition that the matching degree of the frequency spectrum is greater than a frequency spectrum threshold value.

Optionally, parsing the semantics in the acquired audio signal comprises: collecting voice signals input by a user; removing the non-human voice signals in the voice signals to obtain the audio signals; parsing semantics in the audio signal.

Optionally, under the condition that the preset wake-up word is included in the semantic meaning, analyzing the frequency spectrum in the audio signal includes: resolving a pitch and/or tempo in the audio signal; performing a spectral analysis of the tones and/or tempos.

Optionally, in a case that the matching degree of the frequency spectrum is greater than a frequency spectrum threshold, controlling the operation of the device using the audio signal includes: speech encoding the audio signal; and sending the voice code to a voice control center of the equipment, and instructing the voice control equipment to respond to the voice code so as to control the operation of the equipment.

According to another embodiment of the present invention, there is also provided a control method of an apparatus, including: analyzing the frequency spectrum of the acquired audio signal; analyzing semantics in the audio signal under the condition that the matching degree of the frequency spectrum is greater than a frequency spectrum threshold value; and under the condition that the semantics comprise a preset awakening word, controlling the operation of the equipment by using the preset awakening word, wherein the preset awakening word comprises a command for controlling the operation of the equipment.

According to another embodiment of the present invention, there is also provided a control apparatus of a device, including: the first analysis module is used for analyzing the semantics in the acquired audio signal; the first analysis module is used for analyzing the frequency spectrum of the audio signal under the condition that the semantic meaning comprises a preset awakening word, wherein the preset awakening word comprises a command for controlling equipment to operate; and the first control module is used for controlling the operation of the equipment by using the audio signal under the condition that the matching degree of the frequency spectrum is greater than a frequency spectrum threshold value.

Optionally, the first parsing module includes: the first acquisition unit is used for acquiring a voice signal input by a user; the first determining unit is used for removing the non-human voice signals in the voice signals to obtain the audio signals; a first parsing unit for parsing semantics in the audio signal.

According to another embodiment of the present invention, there is also provided a control apparatus of a device, including: the second analysis module is used for analyzing the frequency spectrum of the acquired audio signal; the second analysis module is used for analyzing the semantics in the audio signal under the condition that the matching degree of the frequency spectrum is greater than a frequency spectrum threshold value; and the second control module is used for controlling the operation of the equipment by using the preset awakening word under the condition that the semantic meaning comprises the preset awakening word, wherein the preset awakening word comprises a command for controlling the operation of the equipment.

According to a further embodiment of the present invention, there is also provided a storage medium having a computer program stored therein, wherein the computer program is arranged to perform the steps of any of the above method embodiments when executed.

According to yet another embodiment of the present invention, there is also provided an electronic device, including a memory in which a computer program is stored and a processor configured to execute the computer program to perform the steps in any of the above method embodiments.

According to the invention, after the voice is received, the semantic analysis is carried out on the audio signal in the voice, and the frequency spectrum of the audio signal is analyzed under the condition that the semantic includes the preset awakening word, wherein the preset awakening word includes a command for controlling the operation of equipment; and controlling the operation of the equipment by using the audio signal under the condition that the matching degree of the frequency spectrum is greater than the frequency spectrum threshold value. I.e. the wake-up word and the spectrum in the audio signal are combined to control the device. Therefore, the problem that the voice control equipment in the related technology is inaccurate can be solved, and the effect of accurately controlling the equipment by using the voice is achieved.

Drawings

The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the invention and together with the description serve to explain the invention without limiting the invention. In the drawings:

fig. 1 is a block diagram of a hardware configuration of a mobile terminal of a device control method according to an embodiment of the present invention;

fig. 2 is a flowchart (one) of a control method of an apparatus according to an embodiment of the present invention;

fig. 3 is a flowchart (two) of a control method of an apparatus according to an embodiment of the present invention;

FIG. 4 is a block diagram (one) of the control apparatus of the device according to the embodiment of the present invention;

FIG. 5 is a block diagram of the structure of a control device of the apparatus according to an embodiment of the present invention;

FIG. 6 is a flowchart (one) of the wake-up operation of the voice air conditioner in the present embodiment;

fig. 7 is a flowchart of the wake-up operation of the voice air conditioner in this embodiment (two).

Detailed Description

The invention will be described in detail hereinafter with reference to the accompanying drawings in conjunction with embodiments. It should be noted that the embodiments and features of the embodiments in the present application may be combined with each other without conflict.

It should be noted that the terms "first," "second," and the like in the description and claims of the present invention and in the drawings described above are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order.

Example 1

The method provided by the first embodiment of the present application may be executed in a mobile terminal, a computer terminal, or a similar computing device. Taking an example of the present invention running on a mobile terminal, fig. 1 is a block diagram of a hardware structure of the mobile terminal of a device control method according to an embodiment of the present invention. As shown in fig. 1, the mobile terminal 10 may include one or more (only one shown in fig. 1) processors 102 (the processor 102 may include, but is not limited to, a processing device such as a microprocessor MCU or a programmable logic device FPGA) and a memory 104 for storing data, and optionally may also include a transmission device 106 for communication functions and an input-output device 108. It will be understood by those skilled in the art that the structure shown in fig. 1 is only an illustration, and does not limit the structure of the mobile terminal. For example, the mobile terminal 10 may also include more or fewer components than shown in FIG. 1, or have a different configuration than shown in FIG. 1.

The memory 104 may be used for storing computer programs, for example, software programs and modules of application software, such as computer programs corresponding to the control method of the device in the embodiment of the present invention, and the processor 102 executes various functional applications and data processing by running the computer programs stored in the memory 104, so as to implement the method described above. The memory 104 may include high speed random access memory, and may also include non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid-state memory. In some instances, the memory 104 may further include memory located remotely from the processor 102, which may be connected to the mobile terminal 10 via a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.

The transmission device 106 is used for receiving or transmitting data via a network. Specific examples of the network described above may include a wireless network provided by a communication provider of the mobile terminal 10. In one example, the transmission device 106 includes a Network adapter (NIC), which can be connected to other Network devices through a base station so as to communicate with the internet. In one example, the transmission device 106 may be a Radio Frequency (RF) module, which is used for communicating with the internet in a wireless manner.

In the present embodiment, a control method of a device is provided, and fig. 2 is a flowchart (a) of the control method of the device according to the embodiment of the present invention, as shown in fig. 2, the flowchart includes the following steps:

step S202, analyzing the semanteme in the obtained audio signal;

step S204, under the condition that the semantic meaning comprises a preset awakening word, analyzing the frequency spectrum of the audio signal, wherein the preset awakening word comprises a command for controlling the equipment to operate;

and step S206, controlling the operation of the equipment by using the audio signal under the condition that the matching degree of the frequency spectrum is greater than the frequency spectrum threshold value.

Through the steps, after the voice is received, semantic analysis is carried out on the audio signal in the voice, and under the condition that the semantic includes the preset awakening word, the frequency spectrum of the audio signal is analyzed, wherein the preset awakening word includes a command for controlling the equipment to operate; and controlling the operation of the equipment by using the audio signal under the condition that the matching degree of the frequency spectrum is greater than the frequency spectrum threshold value. I.e. the wake-up word and the spectrum in the audio signal are combined to control the device. Therefore, the problem that the voice control equipment in the related technology is inaccurate can be solved, and the effect of accurately controlling the equipment by using the voice is achieved.

Optionally, the executing subject of the above steps may be a device, such as an air conditioner, a household electrical appliance such as a television, and the like, but is not limited thereto.

In this embodiment, the audio signal is a spoken voice, and the voice includes a preset wake-up word of the wake-up device and an audio signal spectrum. Wherein, predetermine awakening word and can be "open the air conditioner", "close the air conditioner", "turn up the air conditioner degree" etc. and the frequency spectrum in the audio signal includes: the pitch and/or tempo are analyzed, in particular, the pitch and/or tempo in the audio signal is analyzed, and the frequency spectrum of the pitch and/or tempo is analyzed. I.e. to determine the degree of matching of the frequency spectrum in the acquired audio signal with the frequency spectrum stored in the device.

In an alternative embodiment, the semantics in the acquired audio signal are resolved by: collecting voice signals input by a user; removing non-human voice signals in the voice signals to obtain audio signals; parsing semantics in the audio signal. In this embodiment, the non-human voice signal includes some noisy signals, and the semantics mainly include voices for controlling the device, such as "turn on air conditioner", "turn off air conditioner", "turn up air conditioner degree", and the like.

In an alternative embodiment, the operation of the device is controlled by the audio signal in the case that the degree of matching of the frequency spectrum is greater than a frequency spectrum threshold by: performing speech coding on the audio signal; the speech coding is transmitted to a speech control center of the device instructing the speech control device to respond to the speech coding to control operation of the device. In this embodiment, a voice control center in the device controls each function of the device, matches the obtained voice code with the related function, and controls the operation of each function.

In the present embodiment, a control method of a device is provided, and fig. 3 is a flowchart (two) of the control method of the device according to the embodiment of the present invention, and as shown in fig. 3, the flowchart includes the following steps:

step S302, analyzing the frequency spectrum of the obtained audio signal;

step S304, under the condition that the matching degree of the frequency spectrum is greater than a frequency spectrum threshold value, analyzing the semantics in the audio signal;

step S306, under the condition that the semantic meaning includes the preset awakening word, the preset awakening word is used for controlling the operation of the equipment, wherein the preset awakening word includes a command for controlling the operation of the equipment.

Through the steps, after the voice is received, the frequency spectrum of the audio signal is analyzed, under the condition that the matching degree of the frequency spectrum is larger than the frequency spectrum threshold value, the semantic analysis is carried out on the audio signal in the voice, and under the condition that the semantic includes the preset awakening word, the operation of the equipment is controlled by using the preset awakening word. I.e. the wake-up word and the spectrum in the audio signal are combined to control the device. Therefore, the problem that the voice control equipment in the related technology is inaccurate can be solved, and the effect of accurately controlling the equipment by using the voice is achieved.

In this embodiment, the spectrum is analyzed first, and then the semantic meaning is analyzed, and the audio and semantic meaning are also combined.

Fig. 4 is a block diagram (one) of the structure of a control device of an apparatus according to an embodiment of the present invention, as shown in fig. 4, the device includes: a first analysis module 42, a first analysis module 44 and a first control module 46, which are described in detail below:

a first parsing module 42, configured to parse semantics in the acquired audio signal;

a first analyzing module 44, connected to the first analyzing module 42, for analyzing the frequency spectrum of the audio signal if the semantic includes a preset wake-up word, where the preset wake-up word includes a command for controlling the operation of the device;

and a first control module 46, connected to the first analysis module 44, for controlling the operation of the device by using the audio signal if the matching degree of the frequency spectrum is greater than the frequency spectrum threshold.

In an alternative embodiment, the first parsing module 42 includes: the first acquisition unit is used for acquiring a voice signal input by a user; the first determining unit is connected to the first acquisition unit and used for removing the non-human voice signals in the voice signals to obtain audio signals; and the first analysis unit is connected to the first determination unit and used for analyzing the semantics in the audio signal.

In an alternative embodiment, the first analysis module 44 analyzes the spectrum in the audio signal in case the preset wake up word is included in the semantics by: resolving a pitch and/or tempo in the audio signal; the pitch and/or tempo is spectrally analyzed.

In an alternative embodiment, the first control module 46 controls the operation of the device with the audio signal in the case that the matching degree of the frequency spectrum is greater than the frequency spectrum threshold by: performing speech coding on the audio signal; the speech coding is transmitted to a speech control center of the device instructing the speech control device to respond to the speech coding to control operation of the device.

Fig. 5 is a block diagram (ii) of the structure of a control apparatus of a device according to an embodiment of the present invention, as shown in fig. 5, the apparatus including: a second analysis module 52, a second analysis module 54, and a second control module 56, which are described in detail below:

a second analysis module 52 for analyzing the frequency spectrum of the acquired audio signal; a second parsing module 54, connected to the second parsing module 52, for parsing the semantics in the audio signal if the matching degree of the spectrum is greater than the spectrum threshold; and a second control module 56, connected to the second parsing module 54, configured to control operation of the device by using a preset wake-up word when the semantic includes the preset wake-up word, where the preset wake-up word includes a command for controlling operation of the device.

It should be noted that, the above modules may be implemented by software or hardware, and for the latter, the following may be implemented, but not limited to: the modules are all positioned in the same processor; alternatively, the modules are respectively located in different processors in any combination.

The present invention will be described in detail with reference to the following specific examples:

the present invention is described below by taking an air conditioner as an example:

In fact, the audio is in the form of sound waves spread in the time dimension, and whether the audio is similar or the same can be judged by analyzing the wave fronts (ripples, waves) and comparing the wave fronts (ripples, waves). The analogy to the KTV scoring software and the song listening and identifying software is based on the pitch, rhythm and your score and judge which song is, which is identified by the pop (or wave, waveform) of the audio.

The traditional voice air conditioner is only awakened by awakening words and phrases, so that false awakening is easy to occur, and the recognition rate is low; the traditional air conditioner awakening mode is uninteresting, and if the traditional air conditioner awakening mode is used for simultaneously identifying the tone and the rhythm through awakening word interception. Based on this, the present embodiment provides a way that the keyword and its tone and rhythm reach the predetermined value at the same time (instead of just meeting the keyword), and the air conditioner can be woken up.

The embodiment mainly comprises the following steps: the threshold value and the calculation method for waking up the air conditioner by combining the wake-up word with the tone and rhythm are customized for the voice air conditioner, so that a user can wake up the air conditioner in a singing input mode, the error recognition rate is reduced, and the wake-up interestingness is improved.

Fig. 6 is a flowchart (one) of the wake-up work of the voice air conditioner in this embodiment, which specifically includes the following steps:

s601: acquiring a current audio signal input by a user;

s602: and after the non-human voice is removed, analyzing the semantics in the audio signal. And forming a command word after the semantic analysis is successful, and comparing the command word with a preset awakening word.

S603: when the command word contains the preset awakening word, a series of signal processing methods are adopted to carry out spectrum matching analysis on the audio signal. And if the audio signal matching degree is greater than the threshold value, the voice awakening command is considered to be successfully identified.

S604: and acquiring a voice awakening command code, and sending the awakening command code to the air conditioner, wherein the air conditioner has the function of awakening the voice.

The embodiment solves the technical problem that the awakening mode of the voice control mode in the prior art easily causes mistaken awakening or recognition rate, and achieves the effect of simply, accurately and interestingly awakening the air conditioner.

Fig. 7 is a flowchart (ii) of the wake-up work of the voice air conditioner in this embodiment, which specifically includes the following steps:

s701: voice collection;

s702: removing the non-human voice;

s703: judging whether voice exists or not, if so, turning to S704, otherwise, turning to S701;

s704: and (3) semantic parsing:

s705: judging whether the word is a wake-up word, if so, turning to S706, otherwise, turning to S703;

s706: processing the audio signal;

s707: judging whether the matching degree of the tone and the rhythm in the audio signal is greater than a threshold value, if so, turning to S708, otherwise, S701;

s708: recognizing a voice wake-up command;

s709: acquiring a wake-up command code;

s710: the air conditioner executes the wake-up command code.

In conclusion, due to the adoption of the awakening mode of combining the awakening words with the tone and the rhythm, compared with the traditional voice air conditioner, the identification rate and the awakening efficiency are improved, and the interestingness is increased.

Alternatively, in the present embodiment, the storage medium may be configured to store a computer program for executing the above steps.

Optionally, in this embodiment, the storage medium may include, but is not limited to: various media capable of storing computer programs, such as a usb disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a removable hard disk, a magnetic disk, or an optical disk.

Embodiments of the present invention also provide an electronic device comprising a memory having a computer program stored therein and a processor arranged to run the computer program to perform the steps of any of the above method embodiments.

Optionally, the electronic apparatus may further include a transmission device and an input/output device, wherein the transmission device is connected to the processor, and the input/output device is connected to the processor.

Optionally, the specific examples in this embodiment may refer to the examples described in the above embodiments and optional implementation manners, and this embodiment is not described herein again.

It will be apparent to those skilled in the art that the modules or steps of the present invention described above may be implemented by a general purpose computing device, they may be centralized on a single computing device or distributed across a network of multiple computing devices, and alternatively, they may be implemented by program code executable by a computing device, such that they may be stored in a storage device and executed by a computing device, and in some cases, the steps shown or described may be performed in an order different than that described herein, or they may be separately fabricated into individual integrated circuit modules, or multiple ones of them may be fabricated into a single integrated circuit module. Thus, the present invention is not limited to any specific combination of hardware and software.

The above description is only a preferred embodiment of the present invention and is not intended to limit the present invention, and various modifications and changes may be made by those skilled in the art. Any modification, equivalent replacement, or improvement made within the principle of the present invention should be included in the protection scope of the present invention.

Claims

1. A method of controlling a device, comprising:

analyzing the semantics of the obtained audio signal;

analyzing the frequency spectrum of the audio signal under the condition that the semantic meaning comprises a preset awakening word, wherein the preset awakening word comprises a command for controlling the equipment to operate;

controlling the operation of the equipment by using the audio signal under the condition that the matching degree of the frequency spectrum is greater than a frequency spectrum threshold value;

analyzing the frequency spectrum in the audio signal comprises, in the case that the preset wake-up word is included in the semantic meaning: resolving a pitch and/or tempo in the audio signal; performing spectral analysis on the tones and/or tempos;

in the case that the matching degree of the frequency spectrum is greater than a frequency spectrum threshold value, controlling the operation of the device by using the audio signal comprises: speech encoding the audio signal; and sending the voice code to a voice control center of the equipment, and instructing the voice control equipment to respond to the voice code so as to control the operation of the equipment.

2. The method of claim 1, wherein parsing semantics in the acquired audio signal comprises:

collecting voice signals input by a user;

removing the non-human voice signals in the voice signals to obtain the audio signals;

parsing semantics in the audio signal.

3. A method of controlling a device, comprising:

analyzing the frequency spectrum of the acquired audio signal;

analyzing semantics in the audio signal under the condition that the matching degree of the frequency spectrum is greater than a frequency spectrum threshold value;

under the condition that the semantics comprise a preset awakening word, controlling the operation of the equipment by using the preset awakening word, wherein the preset awakening word comprises a command for controlling the operation of the equipment;

4. A control apparatus of a device, characterized by comprising:

the first analysis module is used for analyzing the semantics in the acquired audio signal;

the first analysis module is used for analyzing the frequency spectrum of the audio signal under the condition that the semantic meaning comprises a preset awakening word, wherein the preset awakening word comprises a command for controlling equipment to operate;

the first control module is used for controlling the operation of the equipment by using the audio signal under the condition that the matching degree of the frequency spectrum is greater than a frequency spectrum threshold value;

5. The apparatus of claim 4, wherein the first parsing module comprises:

the first acquisition unit is used for acquiring a voice signal input by a user;

the first determining unit is used for removing the non-human voice signals in the voice signals to obtain the audio signals;

a first parsing unit for parsing semantics in the audio signal.

6. A control apparatus of a device, characterized by comprising:

the second analysis module is used for analyzing the frequency spectrum of the acquired audio signal;

the second analysis module is used for analyzing the semantics in the audio signal under the condition that the matching degree of the frequency spectrum is greater than a frequency spectrum threshold value;

the second control module is used for controlling the operation of the equipment by using a preset awakening word under the condition that the semantic meaning comprises the preset awakening word, wherein the preset awakening word comprises a command for controlling the operation of the equipment;

7. A storage medium having a computer program stored thereon, wherein the computer program is arranged to perform the method of claims 1 to 2 or 3 when executed.

8. An electronic device comprising a memory and a processor, wherein the memory has stored therein a computer program, and wherein the processor is arranged to execute the computer program to perform the method of claims 1 to 2, or claim 3.