CN115938355A - Voice recognition method and device, electronic equipment and storage medium - Google Patents

Voice recognition method and device, electronic equipment and storage medium Download PDF

Info

Publication number
CN115938355A
CN115938355A CN202111108287.6A CN202111108287A CN115938355A CN 115938355 A CN115938355 A CN 115938355A CN 202111108287 A CN202111108287 A CN 202111108287A CN 115938355 A CN115938355 A CN 115938355A
Authority
CN
China
Prior art keywords
control instruction
controllable
instruction
user
control
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111108287.6A
Other languages
Chinese (zh)
Inventor
田书君
许升
丁晓鹏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qingdao Haier Washing Machine Co Ltd
Haier Smart Home Co Ltd
Original Assignee
Qingdao Haier Washing Machine Co Ltd
Haier Smart Home Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qingdao Haier Washing Machine Co Ltd, Haier Smart Home Co Ltd filed Critical Qingdao Haier Washing Machine Co Ltd
Priority to CN202111108287.6A priority Critical patent/CN115938355A/en
Publication of CN115938355A publication Critical patent/CN115938355A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/02Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]

Landscapes

  • Selective Calling Equipment (AREA)

Abstract

The application provides a voice recognition method, a voice recognition device, electronic equipment and a storage medium. The method comprises the steps of selecting a first controllable device closest to a user from a plurality of controllable devices as an auxiliary monitoring device according to a device awakening instruction, determining a second controllable device from the plurality of controllable devices as a device to be controlled according to the device awakening instruction, receiving a first control instruction acquired by the first controllable device and sent by the user, receiving a second control instruction acquired by the second controllable device and sent by the user, and controlling the second controllable device to execute the second control instruction if the first control instruction is the same as the second control instruction. By using the method provided by the application, not only can a plurality of controllable devices be prevented from being awakened simultaneously, but also the controllable devices can be assisted to acquire complete instruction information.

Description

Voice recognition method and device, electronic equipment and storage medium
Technical Field
The present application relates to the field of speech recognition technologies, and in particular, to a speech recognition method and apparatus, an electronic device, and a storage medium.
Background
The appearance of voice home appliances brings convenience to life for people. With the wide use of voice home appliances, how to accurately wake up a specific voice home appliance and enhance the capability of the voice home appliance to acquire information has gradually become a problem concerned in the life of people.
In the prior art, in order to wake up a specific voice home device accurately, a voice home device to be woken up is generally determined by determining a distance between the voice home device and a user.
However, the device to be awakened is determined according to the distance, so that the accuracy is not certain, and the information acquisition capability of the voice household appliance cannot be enhanced.
Disclosure of Invention
The application provides a voice recognition method which is used for solving the problems that a plurality of voice household appliances are waken up by mistake and the capability of acquiring information of the voice household appliances is poor.
In one aspect, the present application provides a speech recognition method, including:
acquiring a device awakening instruction, selecting a first controllable device closest to a user from the plurality of controllable devices as an auxiliary monitoring device according to the device awakening instruction, and determining a second controllable device from the plurality of controllable devices as a device to be controlled according to the device awakening instruction;
receiving a first control instruction sent by a user and collected by first controllable equipment, and receiving a second control instruction sent by the user and collected by second controllable equipment;
and if the first control instruction is the same as the second control instruction, controlling the second controllable device to execute the second control instruction.
In another aspect, the present application provides a speech recognition apparatus comprising:
the device comprises a determining module, a monitoring module and a control module, wherein the determining module is used for acquiring a device awakening instruction, selecting a first controllable device closest to a user from the plurality of controllable devices as an auxiliary monitoring device according to the device awakening instruction, and determining a second controllable device from the plurality of controllable devices as a device to be controlled according to the device awakening instruction;
and the receiving control module is used for receiving a first control instruction which is acquired by first controllable equipment and sent by a user, and receiving a second control instruction which is acquired by second controllable equipment and sent by the user, wherein the first control instruction is the same as the second control instruction, and the second controllable equipment is controlled to execute the second control instruction.
In yet another aspect, the present application provides an electronic device comprising:
at least one processor; and
a memory;
the memory stores computer execution instructions;
the at least one processor executing the memory-stored computer-executable instructions causes the at least one processor to perform implementing the method of any one of the preceding claims.
In a final aspect, the application provides a computer program product comprising a computer program which, when executed by a processor, performs the method as in any of the previous claims.
The voice recognition method, the voice recognition device, the electronic device and the storage medium provided by the application select the first controllable device closest to a user from the plurality of controllable devices as the auxiliary monitoring device by acquiring the device awakening instruction, and determine the second controllable device from the plurality of controllable devices as the device to be controlled, receive the first control instruction sent by the user collected by the first controllable device and receive the second control instruction sent by the user collected by the second controllable device according to the device awakening instruction, and if the first control instruction is the same as the second control instruction, control the second controllable device to execute the second control instruction.
By the method, the nearest controllable device and the device to be controlled are determined through the distance, then the nearest controllable device is used for assisting the device to be controlled to enhance the instruction of information acquisition of the device to be controlled, and the corresponding instruction is executed. Compared with the method for determining the equipment to be controlled only through the distance in the prior art, the method and the device for determining the equipment to be controlled can avoid simultaneously awakening a plurality of controllable equipment and assist the controllable equipment to acquire complete instruction information.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the present application and together with the description, serve to explain the principles of the application.
FIG. 1 is a schematic diagram of a network architecture on which the present application is based;
FIG. 2 is a flow chart of a speech recognition method provided herein;
FIG. 3 is a schematic diagram of a speech recognition apparatus provided in the present application;
fig. 4 is a schematic diagram of a hardware structure of an electronic device provided in the present application.
Specific embodiments of the present application have been shown by way of example in the drawings and will be described in more detail below. The drawings and written description are not intended to limit the scope of the inventive concepts in any manner, but rather to illustrate the concepts of the application by those skilled in the art with reference to specific embodiments.
Detailed Description
Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, like numbers in different drawings represent the same or similar elements unless otherwise indicated. The embodiments described in the following exemplary embodiments do not represent all embodiments consistent with the present application. Rather, they are merely examples of apparatus and methods consistent with certain aspects of the present application, as detailed in the appended claims.
With the development of internet of things and the popularization of intelligent home appliances, voice home appliances have been widely popularized and used. How to avoid the simultaneous awakening of a plurality of voice household appliances and enhance the capability of voice recognition household appliances for collecting information becomes the focus of attention in the field.
In the prior art, in order to avoid the simultaneous wake-up of multiple voice home appliances, the voice home appliances are mainly wakened up by a distance policy, that is, for the same type of voice home appliances, the nearest voice home appliance is selected as a wake-up appliance according to the distance between a user and each voice home appliance.
Obviously, in the prior art, the awakened voice household appliance is determined by means of judging the distance between the voice household appliance of the same category and the user, which is not accurate enough and has limitations, and meanwhile, the processing method cannot enhance the information acquisition capability of the voice household appliance.
For the technical problem, the inventor considers that the distance between the voice household appliance and the user can be utilized to determine the nearest device, and then finds the device to be executed, and then utilizes the nearest device as an auxiliary monitoring device of the device to be executed, so as to enhance the capability of the device to be executed for acquiring voice information and more accurately execute the instruction issued by the user.
Specifically, the application provides a speech recognition method. The method comprises the steps of selecting a first controllable device closest to a user from a plurality of controllable devices as an auxiliary monitoring device according to a device awakening instruction, determining a second controllable device from the plurality of controllable devices as a device to be controlled according to the device awakening instruction, receiving a first control instruction acquired by the first controllable device and sent by the user, receiving a second control instruction acquired by the second controllable device and sent by the user, and controlling the second controllable device to execute the second control instruction if the first control instruction is the same as the second control instruction. By using the method provided by the application, not only can a plurality of controllable devices be prevented from being awakened simultaneously, but also the controllable devices can be enhanced to acquire complete voice instruction information.
The following describes in detail the technical solutions of the embodiments of the present application and how to solve the above technical problems with specific embodiments. The following several specific embodiments may be combined with each other, and details of the same or similar concepts or processes may not be repeated in some embodiments. Embodiments of the present application will be described below with reference to the accompanying drawings.
Referring to fig. 1, fig. 1 is a schematic diagram of a network architecture on which the present application is based, and as shown in fig. 1, the network architecture includes a server 1 and a voice home device 2.
The server 1 is specifically a server cluster capable of processing mass data, and a voice recognition device provided by the present application may be integrated or installed therein, where the voice recognition device may determine a device to be executed based on a voice recognition method provided by the present application and enhance the capability of the device to acquire information.
The voice household appliance 2 can be specifically a household appliance capable of performing intelligent voice interaction with a user, such as an intelligent voice sound box, an intelligent voice washing machine, an intelligent voice refrigerator and the like, based on network communication, the voice household appliance 2 can upload collected voice information to the server 1 for use by a voice recognition device in the server 1, judge the auxiliary monitoring device and the equipment to be executed from the voice household appliance 2, and control the auxiliary monitoring device to enhance the integrity of the equipment to be executed for collecting the voice information sent by the user, so that the equipment to be executed can execute a user instruction more accurately.
Example one
Fig. 2 is a schematic flowchart of a speech recognition method provided in the present application, where as shown in fig. 2, the method includes:
step 201, obtaining a device wake-up instruction, selecting a first controllable device closest to a user from the plurality of controllable devices according to the device wake-up instruction to serve as an auxiliary monitoring device, and determining a second controllable device from the plurality of controllable devices to serve as a device to be controlled according to the device wake-up instruction;
step 202, receiving a first control instruction sent by a user and acquired by a first controllable device, receiving a second control instruction sent by the user and acquired by a second controllable device, and if the first control instruction is the same as the second control instruction, controlling the second controllable device to execute the second control instruction.
Specifically, the main execution body of the speech recognition method provided by the present application is the aforementioned speech recognition device, and as mentioned above, the speech recognition device may be specifically installed or carried in the aforementioned server 1.
The intelligent home appliance system according to the present embodiment is configured to set different types of voice home appliances, i.e., controllable appliances, in the same group network, and the server 1 uniformly controls the voice home appliances.
In an actual scene, after a user sends a wake-up instruction and is acquired by a voice household appliance, the voice household appliance uploads the acquired voice instructions to the server 1, the server 1 judges that a first controllable device closest to the user is used as an auxiliary monitoring device according to the acquired voice instructions, and then determines that a second controllable device is used as a device to be controlled through the voice instructions.
Subsequently, the auxiliary monitoring device and the device to be controlled respectively upload the obtained user voice information instruction to the server 1, so that the server determines whether the device to be controlled obtains a complete user instruction according to the semantics of the two voice information, and if so, the device to be controlled executes the instruction.
Compared with the prior art, the mode of awakening and collecting the voice information of the user can avoid awakening a plurality of voice household appliances at the same time, improve the accuracy of awakening the equipment to be executed and enhance the capability of the voice household appliances for collecting the voice information of the user.
In step 201, a wake-up command, such as turning on the washing machine, playing music, starting the sweeping robot, etc., is pre-stored in the server for the voice recognition device to use. And the voice recognition device determines the auxiliary monitoring equipment and the equipment to be executed according to the voice information instruction uploaded to the server by the controllable equipment.
Specifically, after the user sends the wake-up instruction, all controllable devices in the intelligent household appliance system can acquire the wake-up instruction sent by the user, enter a standby state, and simultaneously upload the voice instruction information of the user respectively acquired to the server.
And then, the voice recognition device in the server determines the position distribution of each controllable device in the current environment according to the signal-to-noise ratio of the awakening voice instruction information acquired by each controllable device, and determines the voice household appliance closest to the user according to the position distribution. The signal-to-noise ratio refers to a ratio of the intensity of a voice signal acquired by the voice household appliance to peripheral noise, namely, the larger the signal-to-noise ratio is, the closer the voice household appliance is to a user.
Meanwhile, the voice recognition device determines the equipment to be executed according to the association relationship between the semantics and the voice household appliance preset in the server and the semantics in the awakening instruction information collected by the voice household appliance. The semantic and voice home appliance association relationship means that a certain word in the voice information can be locked to a specific voice home appliance.
After determining that the auxiliary monitoring device and the device to be controlled are present, the voice recognition apparatus further performs step 202, receives the control instructions acquired by the two devices, and then determines whether the control instructions acquired by the two devices are consistent.
Specifically, the voice recognition device receives a first control instruction sent by a user and acquired by the auxiliary monitoring equipment, and a second control instruction sent by the user and acquired by the equipment to be controlled. Then the voice recognition device analyzes whether the first control instruction and the second control instruction are the same, namely, whether the word number of the two control instructions is the same is judged firstly, and if the word number is the same, the first control instruction and the second control instruction are the same.
Further, if the first control instruction is the same as the second control instruction, it is further required to perform word-by-word comparison on the information of the first control instruction and the information of the second control instruction to determine whether the semantics are complete, and if the semantics are complete, the device to be executed executes the second control instruction.
Wherein, the word number may be the sum of the word number of the character and the word number of the symbol in each control instruction; the semantic meaning refers to the specific content of a control instruction sent by a user, such as washing sweater with water at 40 ℃.
Optionally, if the auxiliary monitoring device and the device to be controlled acquire that the first control instruction and the second control instruction sent by the user are different, the voice recognition device determines whether the semantics of the second control instruction are complete, that is, the number of words of the control instruction is different, and needs to determine whether the semantics of the control instruction are complete.
And if the second control instruction information is consistent with the first control instruction information in semantic meaning, namely the second control instruction information is complete in semantic meaning, executing the second control instruction by the equipment to be controlled.
And if the second control instruction information is inconsistent with the first control instruction information in semantics, namely the semantics of the second control instruction is incomplete, complementing the parts with differences in the word-by-word comparison result by using the first control instruction information. Then, the voice recognition device sends a confirmation instruction to the device to be controlled, namely, the device to be controlled sends instruction confirmation information to the user, wherein the instruction confirmation information is second control instruction information which is completed by the first control instruction information, the confirmed information is fed back to the voice recognition device, and if the user determines the request, the device to be controlled executes the second control instruction.
For example, the first instruction information sent by the user and collected by the auxiliary monitoring device is "small X wash sweater with water of 40 ℃, the second instruction information sent by the user and collected by the device to be controlled is" wash sweater with water of 60 ℃, and the voice recognition device finds that the number of words of the control instructions is different, but the semantics of the second control instruction is incomplete by analyzing the control instructions.
The part 60 of the speech recognition device that has a discrepancy in the result of the word-by-word comparison is completed with 40, and the device to be controlled will send "ask you to perform the following operations: is sweater washed with water at 40? If the information sent by the user is 'yes', the equipment to be controlled feeds back the confirmation information to the voice recognition device, and the voice recognition device controls the equipment to be controlled to execute the instruction.
In other optional embodiments, the first control instruction is different from the second control instruction, that is, the word numbers of the first control instruction and the second control instruction are different, the semantics of the control instructions are continuously analyzed, and an instruction that the control instructions conflict with each other is obtained, and the voice recognition device issues a prompt to the device to be controlled to resend the control instruction to remind the user, so that the device to be controlled can acquire instruction information sent by the user again.
Wherein, the conflict instruction indicates that the time semantics have conflict, for example, the first control instruction is 'washing the sweater with water at 40 ℃ in small X', and the second control instruction is 'turning on the sound to play music'. Obviously, the voice household appliance to be controlled by the first control instruction is an intelligent washing machine, and the voice household appliance to be controlled by the second control instruction is an intelligent sound box, because the two control instructions have conflict, the fixed voice recognition device cannot distinguish the accuracy of the first control instruction and the accuracy of the second control instruction, and then the device to be controlled issues an instruction for requesting the user to send the control instruction again.
When the equipment to be controlled acquires the voice finger sent by the user again, the intelligent voice recognition device compares the second control instruction acquired again with the first control instruction again, and if the comparison results are still conflict instructions, the operation is executed again until the comparison results are the same; if the comparison result is the same, the equipment to be controlled executes according to the second control instruction acquired again; if the comparison result is different, judging whether the newly acquired second control instruction semantics are complete, if not, completing the second control instruction by using the first control instruction, and executing the control equipment according to the completed instruction.
The application provides a speech recognition method. The method comprises the steps of selecting a first controllable device closest to a user from a plurality of controllable devices as an auxiliary monitoring device according to a device awakening instruction, determining a second controllable device from the plurality of controllable devices as a device to be controlled according to the device awakening instruction, receiving a first control instruction acquired by the first controllable device and sent by the user, receiving a second control instruction acquired by the second controllable device and sent by the user, and controlling the second controllable device to execute the second control instruction if the first control instruction is the same as the second control instruction. By using the method provided by the application, not only can a plurality of controllable devices be prevented from being awakened simultaneously, but also the controllable devices can be assisted to acquire complete instruction information.
Example two:
corresponding to the heating stove maintenance management method of the present application, fig. 3 is a schematic structural diagram of a voice recognition device provided by the present application. For ease of illustration, only the portions relevant to the present application are shown.
Referring to fig. 3, the voice recognition apparatus includes:
a determining module 10, configured to obtain a device wake-up instruction, select, according to the device wake-up instruction, a first controllable device closest to a user from the multiple controllable devices to serve as an auxiliary monitoring device, and determine, according to the device wake-up instruction, a second controllable device from the multiple controllable devices to serve as a device to be controlled;
the receiving control module 20 is configured to receive a first control instruction sent by a user and acquired by a first controllable device, and receive a second control instruction sent by the user and acquired by a second controllable device; and if the first control instruction is the same as the second control instruction, controlling the second controllable device to execute the second control instruction.
The receiving control module 20 is specifically configured to:
judging whether the control semantics of the second control instruction are complete;
and if so, controlling the second controllable equipment to execute a second control instruction.
The receiving control module 20 is further specifically configured to:
analyzing the control semantics of the first control instruction, and completing the control semantics of the second control instruction by using the control semantics of the first control instruction;
sending an instruction confirmation request to a user by using second controllable equipment, wherein the instruction confirmation information comprises the control semantics of the second control instruction after completion;
and controlling the second controllable device to execute the supplemented second control instruction according to the received feedback information of the instruction confirmation request of the user.
The receiving control module 20 is further specifically configured to:
if the first control instruction and the second control instruction are conflict instructions, reminding a user to resend the control instruction by using the second controllable equipment;
and returning to the step of receiving the first control instruction sent by the user and collected by the first controllable device, and receiving the second control instruction sent by the user and collected by the second controllable device.
The determining module 10 is specifically applied to:
determining the signal-to-noise ratio of the equipment awakening instruction, and determining the position distribution of each controllable equipment in the current environment;
and according to the signal-to-noise ratio and the position distribution, selecting a first controllable device closest to a user from the plurality of controllable devices as an auxiliary monitoring device, and determining a second controllable device from the plurality of controllable devices as a device to be controlled.
The receiving control module 20 is further specifically configured to:
respectively determining the instruction word numbers corresponding to the first control instruction and the second control instruction;
and judging whether the first control instruction and the second control instruction are the same according to the instruction word number of the first control instruction and the instruction word number of the second control instruction.
The implementation principle of the speech recognition device provided in the present application is similar to that in any of the above embodiments, and is not described herein again.
The application provides a voice recognition device. The method comprises the steps of selecting a first controllable device closest to a user from a plurality of controllable devices as an auxiliary monitoring device according to a device awakening instruction, determining a second controllable device from the plurality of controllable devices as a device to be controlled according to the device awakening instruction, receiving a first control instruction acquired by the first controllable device and sent by the user, receiving a second control instruction acquired by the second controllable device and sent by the user, and controlling the second controllable device to execute the second control instruction if the first control instruction is the same as the second control instruction. By utilizing the method provided by the application, not only can a plurality of controllable devices be prevented from being awakened simultaneously, but also the controllable devices can be assisted to acquire complete instruction information
EXAMPLE III
Fig. 4 is a schematic diagram of a hardware structure of the electronic device provided in the present application, and only shows a part related to the present application for convenience of description.
Referring to fig. 4, a schematic structural diagram of an electronic device 1000 suitable for implementing an embodiment of the present application is shown, where the electronic device 1000 may be a terminal device. Among them, the terminal Device may include, but is not limited to, a mobile terminal such as a mobile phone, a notebook computer, a Digital broadcast receiver, a Personal Digital Assistant (PDA), a tablet computer (PAD), a Portable Multimedia Player (PMP), a car navigation Device (e.g., a car navigation terminal), and the like, and a fixed terminal such as a Digital TV, a desktop computer, and the like. The electronic device shown in fig. 4 is only an example, and should not bring any limitation to the functions and the scope of use of the embodiments of the present application.
As shown in fig. 4, the electronic device 1000 may include a control means (e.g., a central processing unit, a graphics processor, etc.) 1001 which may perform various suitable actions and processes according to a program stored in a Read Only Memory (ROM) 1002 or a program loaded from a storage means 1009 into a Random Access Memory (RAM) 1003. In the RAM 1003, various programs and data necessary for the operation of the electronic apparatus 1000 are also stored. The control device 1001, ROM 1002, and RAM 1003 are connected to each other by a bus 1004. An input/output (I/O) interface 1006 is also connected to bus 1004.
Generally, the following devices may be connected to the I/O interface 1006: input devices 1006 including, for example, a touch screen, touch pad, keyboard, mouse, camera, microphone, accelerometer, gyroscope, etc.; an output device 1007 including, for example, a Liquid Crystal Display (LCD), a speaker, a vibrator, and the like; storage devices 1009 including, for example, magnetic tape, hard disk, and the like; and a communication device 10010. The communication device 10010 may allow the electronic device 1000 to communicate wirelessly or by wire with other devices to exchange data. While fig. 4 illustrates an electronic device 1000 having various means, it is to be understood that not all illustrated means are required to be implemented or provided. More or fewer devices may alternatively be implemented or provided.
In particular, according to embodiments of the present application, the processes described above with reference to the flow diagrams may be implemented as computer software programs. For example, embodiments of the present application include a computer program product comprising a computer program embodied on a computer readable medium, the computer program comprising program code for performing the method illustrated by the flow chart. In such an embodiment, the computer program may be downloaded and installed from the network through the communication device 10010, or installed from the storage device 1009, or installed from the ROM 1002. When the computer program is executed by the control device 1001, the above-described functions defined in the method of the embodiment of the present application are performed.
It should be noted that the computer readable medium mentioned above in the present application may be a computer readable signal medium or a computer readable storage medium or any combination of the two. A computer readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples of the computer readable storage medium may include, but are not limited to: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
In the present application, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. In this application, however, a computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may also be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to: electrical wires, optical cables, RF (radio frequency), etc., or any suitable combination of the foregoing.
The computer readable medium may be embodied in the electronic device; or may be separate and not incorporated into the electronic device.
The computer readable medium carries one or more programs which, when executed by the electronic device, cause the electronic device to perform the methods shown in the above embodiments.
A computer program product is provided that may have computer program code written in one or more programming languages, including an object oriented programming language such as Java, smalltalk, C + +, or a combination thereof, as well as conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or media library. In the case of a remote computer, the remote computer may be connected to the user's computer through any type of Network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet service provider).
The flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present application. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
The units described in the embodiments of the present application may be implemented by software or hardware. Where the name of a unit does not in some cases constitute a limitation of the unit itself, for example, the first retrieving unit may also be described as a "unit for retrieving at least two internet protocol addresses".
The functions described herein above may be performed, at least in part, by one or more hardware logic components. For example, without limitation, exemplary types of hardware logic components that may be used include: field Programmable Gate Arrays (FPGAs), application Specific Integrated Circuits (ASICs), application Specific Standard Products (ASSPs), systems on a chip (SOCs), complex Programmable Logic Devices (CPLDs), and the like.
In the context of this application, a machine-readable medium may be a tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. The machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium. A machine-readable medium may include, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples of a machine-readable storage medium would include an electrical connection based on one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
Other embodiments of the present application will be apparent to those skilled in the art from consideration of the specification and practice of the invention disclosed herein. This application is intended to cover any variations, uses, or adaptations of the invention following, in general, the principles of the application and including such departures from the present disclosure as come within known or customary practice within the art to which the invention pertains. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the application being indicated by the following claims.
It will be understood that the present application is not limited to the precise arrangements that have been described above and shown in the drawings, and that various modifications and changes may be made without departing from the scope thereof. The scope of the application is limited only by the appended claims.

Claims (10)

1. A voice recognition method is applied to an intelligent household appliance system, the intelligent household appliance system comprises a plurality of controllable devices, and the voice recognition method comprises the following steps:
acquiring a device awakening instruction, selecting a first controllable device closest to a user from the plurality of controllable devices as an auxiliary monitoring device according to the device awakening instruction, and determining a second controllable device from the plurality of controllable devices as a device to be controlled according to the device awakening instruction;
receiving a first control instruction sent by a user and collected by first controllable equipment, and receiving a second control instruction sent by the user and collected by second controllable equipment;
and if the first control instruction is the same as the second control instruction, controlling the second controllable device to execute the second control instruction.
2. The speech recognition method of claim 1, wherein if the first control instruction is not the same as the second control instruction, the method further comprises:
judging whether the control semantics of the second control instruction are complete;
and if so, controlling the second controllable equipment to execute a second control instruction.
3. The speech recognition method of claim 2, wherein if the control semantics of the second control command are incomplete, the method further comprises:
analyzing the control semantics of the first control instruction, and completing the control semantics of the second control instruction by using the control semantics of the first control instruction;
sending an instruction confirmation request to a user by using second controllable equipment, wherein the instruction confirmation information comprises the control semantics of the second control instruction after completion;
and controlling the second controllable device to execute the supplemented second control instruction according to the received feedback information of the instruction confirmation request of the user.
4. The speech recognition method of claim 1, wherein if the first control instruction is not the same as the second control instruction, the method further comprises:
if the first control instruction and the second control instruction are conflict instructions, reminding a user to resend the control instruction by using the second controllable device;
and returning to the step of receiving the first control instruction sent by the user and collected by the first controllable device, and receiving the second control instruction sent by the user and collected by the second controllable device.
5. The speech recognition method according to any one of claims 1 to 4, wherein the selecting, according to the device wake-up instruction, a first controllable device closest to a user from the plurality of controllable devices as an auxiliary listening device, and determining, according to the device wake-up instruction, a second controllable device from the plurality of controllable devices as a device to be controlled, includes:
determining the signal-to-noise ratio of the equipment awakening instruction, and determining the position distribution of each controllable equipment in the current environment;
and according to the signal-to-noise ratio and the position distribution, selecting a first controllable device closest to a user from the plurality of controllable devices as an auxiliary monitoring device, and determining a second controllable device from the plurality of controllable devices as a device to be controlled.
6. The speech recognition method according to any one of claims 1-4, wherein after receiving the first control command sent by the user and collected by the first controllable device and receiving the second control command sent by the user and collected by the second controllable device, the method further comprises:
respectively determining the instruction word numbers corresponding to the first control instruction and the second control instruction;
and judging whether the first control instruction and the second control instruction are the same according to the instruction word number of the first control instruction and the instruction word number of the second control instruction.
7. A speech recognition apparatus, comprising:
the device comprises a determining module, a monitoring module and a control module, wherein the determining module is used for acquiring a device awakening instruction, selecting a first controllable device closest to a user from the plurality of controllable devices as an auxiliary monitoring device according to the device awakening instruction, and determining a second controllable device from the plurality of controllable devices as a device to be controlled according to the device awakening instruction;
the receiving control module is used for receiving a first control instruction which is acquired by first controllable equipment and sent by a user, and receiving a second control instruction which is acquired by second controllable equipment and sent by the user; and if the first control instruction is the same as the second control instruction, controlling the second controllable device to execute the second control instruction.
8. An electronic device, comprising: at least one processor and memory;
the memory stores computer-executable instructions;
the at least one processor executing the computer-executable instructions stored by the memory causes the at least one processor to perform the method of any one of claims 1-6.
9. A computer-readable storage medium having computer-executable instructions stored thereon which, when executed by a processor, implement the method of any one of claims 1-6.
10. A computer program product comprising a computer program, characterized in that the computer program realizes the method according to any of claims 1-6 when executed by a processor.
CN202111108287.6A 2021-09-22 2021-09-22 Voice recognition method and device, electronic equipment and storage medium Pending CN115938355A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111108287.6A CN115938355A (en) 2021-09-22 2021-09-22 Voice recognition method and device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111108287.6A CN115938355A (en) 2021-09-22 2021-09-22 Voice recognition method and device, electronic equipment and storage medium

Publications (1)

Publication Number Publication Date
CN115938355A true CN115938355A (en) 2023-04-07

Family

ID=86696404

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111108287.6A Pending CN115938355A (en) 2021-09-22 2021-09-22 Voice recognition method and device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN115938355A (en)

Similar Documents

Publication Publication Date Title
US10331399B2 (en) Smart audio playback when connecting to an audio output system
CN103995716B (en) A kind of terminal applies startup method and terminal
US9635129B2 (en) Automatic application discovery, download, integration and launch
JP2019128938A (en) Lip reading based voice wakeup method, apparatus, arrangement and computer readable medium
US11270690B2 (en) Method and apparatus for waking up device
WO2018133307A1 (en) Method and terminal for implementing voice control
CN107731231B (en) Method for supporting multi-cloud-end voice service and storage device
JP7262532B2 (en) VOICE INTERACTIVE PROCESSING METHOD, APPARATUS, ELECTRONIC DEVICE, STORAGE MEDIUM AND PROGRAM
CN111402877A (en) Noise reduction method, device, equipment and medium based on vehicle-mounted multi-sound zone
US10950221B2 (en) Keyword confirmation method and apparatus
CN109036398A (en) Voice interactive method, device, equipment and storage medium
CN111968648B (en) Voice recognition method and device, readable medium and electronic equipment
JP2020038709A (en) Continuous conversation function with artificial intelligence device
EP3407096B1 (en) Method and device for determining descriptive information of precipitation trend, and readable storage medium
CN112242143B (en) Voice interaction method and device, terminal equipment and storage medium
CN111312243B (en) Equipment interaction method and device
CN111916079A (en) Voice response method, system, equipment and storage medium of electronic equipment
CN115938355A (en) Voice recognition method and device, electronic equipment and storage medium
CN111240634A (en) Sound box working mode adjusting method and device
CN113564865A (en) Remote control method and device for washing machine, electronic equipment and storage medium
CN115631514A (en) Palm vein fingerprint-based user identification method, device, equipment and medium
CN115439874A (en) Voice control method and device of equipment, equipment and storage medium
CN116072108A (en) Model generation method, voice recognition method, device, medium and equipment
JP2019091012A (en) Information recognition method and device
CN112114736A (en) Operation control method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination