CN111816192A - Voice equipment and control method, device and equipment thereof - Google Patents

Voice equipment and control method, device and equipment thereof Download PDF

Info

Publication number
CN111816192A
CN111816192A CN202010648687.5A CN202010648687A CN111816192A CN 111816192 A CN111816192 A CN 111816192A CN 202010648687 A CN202010648687 A CN 202010648687A CN 111816192 A CN111816192 A CN 111816192A
Authority
CN
China
Prior art keywords
wake
control instruction
free control
voice
voice data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010648687.5A
Other languages
Chinese (zh)
Inventor
侯雯珺
曹阳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Unisound Intelligent Technology Co Ltd
Xiamen Yunzhixin Intelligent Technology Co Ltd
Original Assignee
Unisound Intelligent Technology Co Ltd
Xiamen Yunzhixin Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Unisound Intelligent Technology Co Ltd, Xiamen Yunzhixin Intelligent Technology Co Ltd filed Critical Unisound Intelligent Technology Co Ltd
Priority to CN202010648687.5A priority Critical patent/CN111816192A/en
Publication of CN111816192A publication Critical patent/CN111816192A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/22Interactive procedures; Man-machine interfaces
    • G10L17/24Interactive procedures; Man-machine interfaces the user being prompted to utter a password or a predefined phrase
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Selective Calling Equipment (AREA)

Abstract

The invention provides voice equipment and a control method, a device and equipment thereof, wherein the method comprises the following steps: monitoring a wake-up free control instruction for controlling the voice equipment; detecting whether the voiceprint associated with the wake-free control instruction is matched with the voiceprint of the order-giving person carried by the wake-free control instruction or not according to the incidence relation between the preset wake-free control instruction and the voiceprint; and if the voiceprint associated with the wake-up-free control instruction is matched with the voiceprint of the order applying personnel, controlling the voice equipment to respond to the wake-up-free control instruction. The invention realizes the verification of the identity corresponding to the monitored wake-up-free control instruction under the condition of setting a large number of wake-up-free control instructions, thereby reducing the false triggering rate caused by external factors and further improving the wake-up-free voice interaction performance of the voice equipment.

Description

Voice equipment and control method, device and equipment thereof
Technical Field
The invention relates to the technical field of voice equipment, in particular to voice equipment and a control method, a control device and control equipment thereof.
Background
In the existing voice equipment, when a user uses the voice equipment, the voice equipment needs to be awakened firstly, and then the user can issue a voice instruction. The method comprises the steps that the recognition function of the voice device is started through a customized command word to perform subsequent interaction, for example, "small degree" is used for feeding back "I am", and then the user continues to say "I want to watch a movie", wherein the "small degree" is a wake-up word, and after receiving an instruction and executing, the device closes the interaction and waits for the next wake-up.
The awakening of the awakening word is used as an important starting step, so that the interaction between a user and the voice equipment is complicated, and the experience of far-field voice interaction is influenced particularly when a plurality of instructions need to be issued. Therefore, in the prior art, a small number of wake-up free control commands, such as "pause playing" and "continue playing", are usually set to meet the needs of the user for quick control. That is, if the user sends the pause play command without waking up the audio device, the audio device can directly respond to the pause play command.
However, the probability of false wake-up is greatly increased due to the increase of the number of the wake-up-free control instructions, and in order to avoid the disturbance of the false wake-up of the voice device to the user, the prior art can only increase a very small number of wake-up-free control instructions and cannot cover the most basic common voice instructions of the user, so that the performance of the "wake-up-free voice interaction" of the voice device is poor.
Disclosure of Invention
In view of the above, an object of the present invention is to provide a voice device, a control method, an apparatus, a device and a storage medium thereof, so as to solve the problem of poor performance of "wake-up free voice interaction" of the voice device.
Based on the above object, the present invention provides a method for controlling a voice device, comprising:
monitoring a wake-up free control instruction; the wake-up-free control instruction is used for controlling voice equipment, and the wake-up-free control instruction carries voiceprints of an order enforcement person who sends the wake-up-free control instruction;
detecting whether the voiceprint associated with the wake-up-free control instruction is matched with the voiceprint of the order enforcement person according to the incidence relation between the preset wake-up-free control instruction and the voiceprint;
and if the voiceprint associated with the wake-up-free control instruction is matched with the voiceprint of the order applying personnel, controlling the voice equipment to respond to the wake-up-free control instruction.
Further, in the control method of the voice device, the monitoring of the wake-up exempting control instruction includes:
if the voice data of the commander is collected, converting the voice data into text information;
extracting key information in the text information;
detecting whether a stored wake-up-free control instruction in the incidence relation between a preset wake-up-free control instruction and the voiceprint is matched with the key information or not;
and if the stored wake-up-free control instruction is matched with the key information, determining to monitor the wake-up-free control instruction.
Further, in the method for controlling a voice device, after controlling the voice device to respond to the wake-up exempting control instruction, the method further includes:
detecting whether historical voice data of the command enforcement personnel failing to control the voice equipment in a wake-up prevention mode exists in a historical time period; the termination time of the historical time period is the first moment when the voice data of the order applying personnel are collected; the starting time of the historical time period is a second time corresponding to the difference value between the first time and a preset time;
if historical voice data of the order applying personnel failing to wake up the voice equipment are stored in a historical time period, determining the intention of the order applying personnel according to the historical voice data;
judging whether the intention is matched with the wake-up-free control instruction;
and if the intention is matched with the wake-free control instruction, associating the historical voice data with the wake-free control instruction.
Further, in the control method of a voice device, the associating the historical voice data with the wake-up exempt control instruction includes:
outputting associated prompt information so as to receive feedback information of the order applying personnel for the associated prompt information;
if the feedback information represents that association is forbidden, forbidding to execute the associated action of the historical voice data and the wake-up-free control instruction;
and if the feedback information indicates that the correlation is allowed, executing the correlation action of the historical voice data and the wake-up-free control instruction.
The present invention also provides a control device of a voice device, comprising:
the monitoring module is used for monitoring the wake-up-free control instruction; the wake-up-free control instruction is used for controlling voice equipment, and the wake-up-free control instruction carries voiceprints of an order enforcement person who sends the wake-up-free control instruction;
the detection module is used for detecting whether the voiceprint associated with the wake-up-free control instruction is matched with the voiceprint of the order enforcement personnel according to the association relation between the preset wake-up-free control instruction and the voiceprint;
and the control module is used for controlling the voice equipment to respond to the wake-up-free control instruction if the voiceprint associated with the wake-up-free control instruction is matched with the voiceprint of the order enforcement personnel.
Further, in the control apparatus of the voice device, the monitoring module is specifically configured to:
if the voice data of the commander is collected, converting the voice data into text information;
extracting key information in the text information;
detecting whether a stored wake-up-free control instruction in the incidence relation between a preset wake-up-free control instruction and the voiceprint is matched with the key information or not;
and if the stored wake-up-free control instruction is matched with the key information, determining to monitor the wake-up-free control instruction.
Further, in the control device of the voice device, the control module is further configured to:
detecting whether historical voice data of the command enforcement personnel failing to control the voice equipment in a wake-up prevention mode exists in a historical time period; the termination time of the historical time period is the first moment when the voice data of the order applying personnel are collected; the starting time of the historical time period is a second time corresponding to the difference value between the first time and a preset time;
if historical voice data of the order applying personnel failing to wake up the voice equipment are stored in a historical time period, determining the intention of the order applying personnel according to the historical voice data;
judging whether the intention is matched with the wake-up-free control instruction;
and if the intention is matched with the wake-free control instruction, associating the historical voice data with the wake-free control instruction.
Further, in the control device of the voice device, the control module is further configured to:
outputting associated prompt information so as to receive feedback information of the order applying personnel for the associated prompt information;
if the feedback information represents that association is forbidden, forbidding to execute the associated action of the historical voice data and the wake-up-free control instruction;
and if the feedback information indicates that the correlation is allowed, executing the correlation action of the historical voice data and the wake-up-free control instruction.
The invention also provides a control device of a voice device, which comprises a memory, a controller and a computer program stored on the memory and capable of running on the controller, and is characterized in that the controller realizes the method as described in any one of the above items when executing the program.
The invention also provides voice equipment and control equipment provided with the voice equipment.
From the above, the voice device, the control method, the control device and the control equipment thereof provided by the invention monitor the wake-up free control instruction for controlling the voice device; whether the voiceprint associated with the monitored wake-up-free control instruction is matched with the voiceprint of an operator carried by the wake-up-free control instruction of the voice equipment is detected according to the incidence relation between the preset wake-up-free control instruction and the voiceprint, so that the identity corresponding to the monitored wake-up-free control instruction is verified under the condition that a large number of wake-up-free control instructions are set, the voice equipment is controlled to respond to the monitored wake-up-free control instruction when the monitored voiceprint associated with the wake-up-free control instruction is matched with the voiceprint of the operator, the false triggering rate caused by external factors is reduced, and the wake-up-free voice interaction performance of the voice equipment is improved.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
FIG. 1 is a flowchart of a first embodiment of a method for controlling a speech device according to the present invention;
FIG. 2 is a flowchart of a second embodiment of a method for controlling a speech device according to the present invention;
FIG. 3 is a schematic structural diagram of a control apparatus of a speech device according to an embodiment of the present invention;
fig. 4 is a schematic structural diagram of an embodiment of a control device of the speech device of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to specific embodiments and the accompanying drawings.
It is to be noted that technical terms or scientific terms used in the embodiments of the present invention should have the ordinary meanings as understood by those having ordinary skill in the art to which the present disclosure belongs, unless otherwise defined. The use of "first," "second," and similar terms in this disclosure is not intended to indicate any order, quantity, or importance, but rather is used to distinguish one element from another. The word "comprising" or "comprises", and the like, means that the element or item listed before the word covers the element or item listed after the word and its equivalents, but does not exclude other elements or items. The terms "connected" or "coupled" and the like are not restricted to physical or mechanical connections, but may include electrical connections, whether direct or indirect. "upper", "lower", "left", "right", and the like are used merely to indicate relative positional relationships, and when the absolute position of the object being described is changed, the relative positional relationships may also be changed accordingly.
In the prior art, due to the increase of the number of the wake-up-free control instructions, disturbance may be brought to a user due to false triggering of external factors (such as noise, music, voice of a television, and the like), so in order to solve the technical problem, the invention can bind the recorded wake-up-free control instructions and voiceprints of corresponding users when the wake-up-free control instructions are recorded, thereby obtaining the association relationship between the preset wake-up-free control instructions and the voiceprints, and thus, the voiceprints of the users are used for verifying the identity information of the wake-up-free control instructions received by the voice equipment, so as to reduce the false triggering rate caused by the external factors. In particular, reference may be made to the following examples:
fig. 1 is a flowchart of a first embodiment of a method for controlling a speech device according to the present invention, and as shown in fig. 1, the method for controlling a speech device according to this embodiment may specifically include the following steps:
100. monitoring a wake-up free control instruction;
specifically, a microphone array of the voice device can be used for collecting voice data sent by the commander, extracting voiceprints of the commander after the voice data of the commander is collected, and converting the voice data of the commander into text information; extracting key information in the text information; such as keywords, etc. After extracting the key information, whether the key information is matched with a stored wake-up-free control instruction in the incidence relation between the preset wake-up-free control instruction and the voiceprint or not can be detected; and if the key information is matched with the stored wake-up-free control instruction, determining to monitor the wake-up-free control instruction of the voice equipment, wherein the wake-up-free control instruction carries the voiceprint of an order administrator sending the wake-up-free control instruction. And if the key information is not matched with the stored wake-up-free control instruction, determining that the wake-up-free control instruction of the voice equipment is not monitored.
101. Detecting whether the voiceprint associated with the wake-up-free control instruction of the voice equipment is matched with the voiceprint of the order enforcement personnel according to the association relation between the preset wake-up-free control instruction and the voiceprint;
in this embodiment, after the wake-up-free control instruction is monitored, the voiceprint associated with the monitored wake-up-free control instruction and the voiceprint of the order applying person can be compared according to the association relationship between the preset wake-up-free control instruction and the voiceprint, so as to detect whether the voiceprint associated with the monitored wake-up-free control instruction is matched with the voiceprint of the order applying person.
102. And if the voiceprint associated with the monitored wake-up-free control instruction is matched with the voiceprint of the order enforcement personnel, controlling the voice equipment to respond to the monitored wake-up-free control instruction.
In this embodiment, if it is detected that the voiceprint associated with the monitored wake-up-free control instruction matches the voiceprint of the command administrator, it is indicated that the monitored wake-up-free control instruction is triggered by a specific user, and is not triggered by an external factor (such as noise, music, voice of a television, and the like) by mistake, and at this time, the voice device may be controlled to respond to the monitored wake-up-free control instruction and execute an action corresponding to the monitored wake-up-free control instruction. Therefore, under the condition of keeping a low false awakening rate, a plurality of awakening-free instructions, such as 'playing music', 'turning on light', 'air conditioning refrigeration', 'air conditioning 25 DEG', 'weather forecast' and the like, can be added, the equipment does not need to be awakened first and then instructions are given, the real 'awakening-free voice interaction' is achieved, and great convenience is brought to daily use of users.
The control method of the voice equipment of the invention monitors the wake-up-free control instruction for controlling the voice equipment; whether the voiceprint associated with the monitored wake-up-free control instruction is matched with the voiceprint of an operator carried by the wake-up-free control instruction of the voice equipment is detected according to the incidence relation between the preset wake-up-free control instruction and the voiceprint, so that the identity corresponding to the monitored wake-up-free control instruction is verified under the condition that a large number of wake-up-free control instructions are set, the voice equipment is controlled to respond to the monitored wake-up-free control instruction when the monitored voiceprint associated with the wake-up-free control instruction is matched with the voiceprint of the operator, the false triggering rate caused by external factors is reduced, and the wake-up-free voice interaction performance of the voice equipment is improved.
In practical applications, in order to quickly respond to the wake-up-free control instruction, when the wake-up-free control instruction is monitored, the key information corresponding to the voice data of the order-executing person is usually compared with the stored wake-up-free control instruction in a text manner, for example, the wake-up-free control instruction is "pause play", and at this time, it is recognized that the voice data of the order-executing person has key information such as "pause", "stop play", and the like, and the wake-up-free control instruction is monitored. When the same order administrator needs to start the wake-up-free control instruction, different expression modes can be adopted for various reasons, so that other key information can be acquired, and the same order administrator cannot think that the wake-up-free control instruction is monitored. For example, when the instructor does not want to play music, the instructor may send "listen to music so much", and at this time, key information such as "pause", "stop playing" cannot be extracted, and thus the audio device does not operate. The operator needs to send out the voice data of 'pause playing' again, which brings bad experience to the user. If an operator wants to use different voice data as the same wake-up-free control instruction, the operator usually needs to set the command by the user, and the process is complicated and brings trouble to the user.
Specifically, fig. 2 is a flowchart of a second embodiment of the method for controlling a speech device in the present invention, and as shown in fig. 2, the method for controlling a speech device in this embodiment may specifically include the following steps:
200. monitoring a wake-up free control instruction;
specifically, a microphone array of the voice device can be used for collecting voice data sent by the commander, extracting voiceprints of the commander after the voice data of the commander is collected, and converting the voice data of the commander into text information; extracting key information in the text information; such as keywords, etc. After extracting the key information, whether the key information is matched with a stored wake-up-free control instruction in the incidence relation between the preset wake-up-free control instruction and the voiceprint or not can be detected; and if the key information is matched with the stored wake-up-free control instruction, determining to monitor the wake-up-free control instruction of the voice equipment, wherein the wake-up-free control instruction carries the voiceprint of an order administrator sending the wake-up-free control instruction. If the key information is not matched with the stored wake-up-free control instruction, the wake-up-free control instruction of the voice equipment is determined not to be monitored, however, the collected voice data of the order applying personnel can be temporarily stored, so that whether the collected voice data of the order applying personnel are used as the wake-up-free control instruction for controlling the voice equipment or not can be determined subsequently.
201. Detecting whether the voiceprint associated with the wake-up-free control instruction of the voice equipment is matched with the voiceprint of the order enforcement personnel according to the association relation between the preset wake-up-free control instruction and the voiceprint; if yes, go to step 202, if no, end;
202. controlling the voice equipment to respond to the monitored wake-up-free control instruction;
if the voiceprint associated with the monitored wake-up-free control instruction is matched with the voiceprint of the order-giving person, the voice equipment can be controlled to respond to the monitored wake-up-free control instruction.
203. Detecting whether historical voice data of a command operator failing to prevent the voice equipment from being awakened exist in a historical time period; if yes, executing step 204, otherwise, ending;
in this embodiment, after the voice device is controlled to respond to the monitored wake-up avoidance control instruction, a plurality of voice data collected by the voice device in the historical time period may be obtained, and whether historical voice data that the voice device wake-up avoidance control fails by the operator exists in the historical time period is detected according to the voiceprint corresponding to each voice data and the voiceprint of the operator, for example, the collected historical voice data of the operator exists in the historical time period, but the historical voice data does not directly control the operation of the voice device, which indicates that the historical voice data that the voice device wake-up avoidance control fails by the operator exists in the historical time period. If the instructor gives "listen to music and do so", at this time, the voice device does not perform the action of suspending playing, that is, the instructor fails to control the voice device, step 204 is performed, otherwise, the operation is ended.
In this embodiment, the termination time of the historical time period is preferably a first time when the voice data of the commander is collected; the starting time of the historical time period is preferably a second time corresponding to a difference between the first time and the preset time period, wherein the preset time period is not suitable to be large, and is preferably 3s, because when the operator controls the voice device to execute the action corresponding to the wake-up free control instruction, if the operator fails to control, the operator will immediately perform the next control.
204. Determining the intention of the order-giving person according to historical voice data;
if historical voice data that the instructor fails to wake up the voice equipment in the historical time period exists, the historical voice data can be imported into a preset intention training model for recognition, so that the intention of the instructor can be obtained, for example, the instructor sends out ' how much music is heard, so that the intention of the instructor is determined to be ' pause playing '.
205. Judging whether the intention of the order-giving personnel is matched with the wake-up-free control instruction or not; if yes, go to step 206, otherwise, end;
specifically, the command applying person controls the voice device through the "pause playing" wake-up free control instruction, so that after the intention of the command applying person is obtained according to the historical voice data of the command applying person, whether the intention of the command applying person matches with the wake-up free control instruction for controlling the voice device can be judged, if the intention of the command applying person matches with the wake-up free control instruction is judged, step 206 is executed, and if the intention of the command applying person does not match with the wake-up free control instruction is judged, the operation is ended. For example, if the command engineer gives "how to listen to music" it may be determined that the command engineer intends to "pause playing" and match the monitored command for the command engineer to avoid waking up, step 206 is performed, otherwise, if the command engineer gives "play other music" it is determined that the command engineer intends to "switch music" and does not match the monitored command for the command engineer to avoid waking up, and then the process is terminated.
206. Historical voice data is associated with the wake-free control instruction.
If it is determined that the intention of the operator is matched with the wake-up avoidance control instruction, it indicates that the operator may use other voice data to express the correct wake-up avoidance control instruction before sending the correct wake-up avoidance control instruction, but the control fails because the other voice data is not stored, and therefore, in this embodiment, the historical voice data may be associated with the wake-up avoidance control instruction, so that in the subsequent process of using the voice device, if the voice data identical to the historical voice data is monitored, the wake-up avoidance control instruction may also be monitored. That is, if it is monitored that the commander sends "how to listen to music" and is worried about, it may be determined that "play is paused" is monitored. Therefore, the order applying personnel do not need to set various voice data aiming at the same wake-up-free control instruction, after the order applying personnel control the voice equipment by using the correct wake-up-free control instruction, the voice control equipment judges that the temporarily stored historical voice data of the order applying personnel are also the intention of reaching the wake-up-free control instruction, and the historical voice data of the order applying personnel and the wake-up-free control instruction are automatically associated. In addition, after the order applying personnel controls the voice equipment by using the correct wake-up-free control instruction, historical voice data of the order applying personnel is associated with the wake-up-free control instruction instead of performing intention judgment before the order applying personnel controls the voice equipment by using the correct wake-up-free control instruction, so that the situation that when the order applying personnel sends the wake-up-free control instruction again, the voice equipment cannot respond to the wake-up-free control instruction in time due to intention judgment is prevented, and the user experience is influenced.
In a specific implementation process, an intention judgment error may occur, and at this time, if the historical voice data is associated with the wake-free control instruction, the probability that the voice device is woken by mistake in the later period may be increased, so in this embodiment, in the step 206 of associating the historical voice data with the wake-free control instruction, an association prompt message including an intention of the operator may be output, so that the operator determines whether the association is possible according to the association prompt message, and sends a feedback message to the voice device, so that after receiving the feedback message of the operator for the association prompt message, if the obtained feedback message indicates that association is prohibited, it is described that the obtained intention message may be erroneous, at this time, the execution of the association action of the historical voice data and the wake-free control instruction may be prohibited; if the obtained feedback information indicates that the association is allowed, the obtained intention information is accurate, and at this time, the operation of associating the historical voice data with the wake-up free control command can be executed.
It should be noted that the method of the embodiment of the present invention may be executed by a single device, such as a computer or a server. The method of the embodiment can also be applied to a distributed scene and completed by the mutual cooperation of a plurality of devices. In the case of such a distributed scenario, one device of the multiple devices may only perform one or more steps of the method according to the embodiment of the present invention, and the multiple devices interact with each other to complete the method.
Fig. 3 is a schematic structural diagram of a control device of a voice apparatus according to an embodiment of the present invention, and as shown in fig. 2, the passing device of the embodiment includes a monitoring module 30, a detecting module 31, and a control module 32.
A monitoring module 30, configured to monitor a wake-up exempting control instruction; the wake-up-free control instruction is used for controlling the voice equipment and carrying voiceprints of the order-giving personnel sending the wake-up-free control instruction;
specifically, the monitoring module 30 may convert the voice data into text information if the voice data of the commander is collected; extracting key information in the text information; detecting whether a stored wake-up-free control instruction in the incidence relation between a preset wake-up-free control instruction and the voiceprint is matched with key information or not; and if the stored wake-up-free control instruction is matched with the key information, determining that the wake-up-free control instruction is monitored.
The detection module 31 is configured to detect whether a voiceprint associated with the wake-up-free control instruction matches a voiceprint of an order enforcement person according to an association relationship between a preset wake-up-free control instruction and the voiceprint;
and the control module 32 is used for controlling the voice equipment to respond to the wake-up-free control instruction if the voiceprint associated with the wake-up-free control instruction is matched with the voiceprint of the order enforcement personnel.
The control device of the voice equipment monitors the wake-up-free control instruction for controlling the voice equipment; whether the voiceprint associated with the monitored wake-up-free control instruction is matched with the voiceprint of an operator carried by the wake-up-free control instruction of the voice equipment is detected according to the incidence relation between the preset wake-up-free control instruction and the voiceprint, so that the identity corresponding to the monitored wake-up-free control instruction is verified under the condition that a large number of wake-up-free control instructions are set, the voice equipment is controlled to respond to the monitored wake-up-free control instruction when the monitored voiceprint associated with the wake-up-free control instruction is matched with the voiceprint of the operator, the false triggering rate caused by external factors is reduced, and the wake-up-free voice interaction performance of the voice equipment is improved.
In one specific implementation, the control module 32 is further configured to:
detecting whether historical voice data of a command operator failing to prevent the voice equipment from being awakened exist in a historical time period; the termination time of the historical time period is the first moment when the voice data of the commander is collected; the starting time of the historical time period is a second time corresponding to the difference value between the first time and the preset time;
if historical voice data that the command applying personnel fail to prevent the voice equipment from being awakened exist in the historical time period, determining the intention of the command applying personnel according to the historical voice data;
judging whether the intention is matched with the wake-up free control instruction;
if the intent matches the wake-free control directive, the historical speech data is associated with the wake-free control directive.
Further, in the above embodiment, the control module 32 is further configured to:
outputting the associated prompt information so as to receive feedback information of the instructor aiming at the associated prompt information;
if the feedback information indicates that the association is forbidden, forbidding to execute the associated action of the historical voice data and the wake-up-free control instruction;
and if the feedback information indicates that the association is allowed, executing the association action of the historical voice data and the wake-up-free control instruction.
The apparatus of the foregoing embodiment is used to implement the corresponding method in the foregoing embodiment, and has the beneficial effects of the corresponding method embodiment, which are not described herein again.
Fig. 4 is a schematic structural diagram of an embodiment of a control device of a speech device of the present invention, and as shown in fig. 3, the control device of the speech device of this embodiment may include: a processor 1010 and a memory 1020. The control devices of the voice device may also include input/output interface 1030, communication interface 1040, and bus 1050, as will be appreciated by those skilled in the art. Wherein the processor 1010, memory 1020, input/output interface 1030, and communication interface 1040 are communicatively coupled to each other within the device via bus 1050.
The processor 1010 may be implemented by a general-purpose CPU (Central Processing Unit), a microprocessor, an Application Specific Integrated Circuit (ASIC), or one or more Integrated circuits, and is configured to execute related programs to implement the technical solutions provided in the embodiments of the present disclosure.
The Memory 1020 may be implemented in the form of a ROM (Read Only Memory), a RAM (Random access Memory), a static storage device, a dynamic storage device, or the like. The memory 1020 may store an operating system and other application programs, and when the technical solution provided by the embodiments of the present specification is implemented by software or firmware, the relevant program codes are stored in the memory 1020 and called to be executed by the processor 1010.
The input/output interface 1030 is used for connecting an input/output module to input and output information. The i/o module may be configured as a component in a device (not shown) or may be external to the device to provide a corresponding function. The input devices may include a keyboard, a mouse, a touch screen, a microphone, various sensors, etc., and the output devices may include a display, a speaker, a vibrator, an indicator light, etc.
The communication interface 1040 is used for connecting a communication module (not shown in the drawings) to implement communication interaction between the present apparatus and other apparatuses. The communication module can realize communication in a wired mode (such as USB, network cable and the like) and also can realize communication in a wireless mode (such as mobile network, WIFI, Bluetooth and the like).
Bus 1050 includes a path that transfers information between various components of the device, such as processor 1010, memory 1020, input/output interface 1030, and communication interface 1040.
It should be noted that although the above-mentioned device only shows the processor 1010, the memory 1020, the input/output interface 1030, the communication interface 1040 and the bus 1050, in a specific implementation, the device may also include other components necessary for normal operation. In addition, those skilled in the art will appreciate that the above-described apparatus may also include only those components necessary to implement the embodiments of the present description, and not necessarily all of the components shown in the figures.
The invention also provides voice equipment which is provided with the control equipment of the voice equipment of the embodiment.
The present invention also provides a storage medium storing computer instructions for causing the computer to execute the control method of the voice device of the above-described embodiment.
Computer-readable media of the present embodiments, including both non-transitory and non-transitory, removable and non-removable media, may implement information storage by any method or technology. The information may be computer readable instructions, data structures, modules of a program, or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), Static Random Access Memory (SRAM), Dynamic Random Access Memory (DRAM), other types of Random Access Memory (RAM), Read Only Memory (ROM), Electrically Erasable Programmable Read Only Memory (EEPROM), flash memory or other memory technology, compact disc read only memory (CD-ROM), Digital Versatile Discs (DVD) or other optical storage, magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices, or any other non-transmission medium that can be used to store information that can be accessed by a computing device.
Those of ordinary skill in the art will understand that: the discussion of any embodiment above is meant to be exemplary only, and is not intended to intimate that the scope of the disclosure, including the claims, is limited to these examples; within the idea of the invention, also features in the above embodiments or in different embodiments may be combined, steps may be implemented in any order, and there are many other variations of the different aspects of the invention as described above, which are not provided in detail for the sake of brevity.
In addition, well known power/ground connections to Integrated Circuit (IC) chips and other components may or may not be shown within the provided figures for simplicity of illustration and discussion, and so as not to obscure the invention. Furthermore, devices may be shown in block diagram form in order to avoid obscuring the invention, and also in view of the fact that specifics with respect to implementation of such block diagram devices are highly dependent upon the platform within which the present invention is to be implemented (i.e., specifics should be well within purview of one skilled in the art). Where specific details (e.g., circuits) are set forth in order to describe example embodiments of the invention, it should be apparent to one skilled in the art that the invention can be practiced without, or with variation of, these specific details. Accordingly, the description is to be regarded as illustrative instead of restrictive.
While the present invention has been described in conjunction with specific embodiments thereof, many alternatives, modifications, and variations of these embodiments will be apparent to those of ordinary skill in the art in light of the foregoing description. For example, other memory architectures (e.g., dynamic ram (dram)) may use the discussed embodiments.
The embodiments of the invention are intended to embrace all such alternatives, modifications and variances that fall within the broad scope of the appended claims. Therefore, any omissions, modifications, substitutions, improvements and the like that may be made without departing from the spirit and principles of the invention are intended to be included within the scope of the invention.

Claims (10)

1. A method for controlling a speech device, comprising:
monitoring a wake-up free control instruction; the wake-up-free control instruction is used for controlling voice equipment, and the wake-up-free control instruction carries voiceprints of an order enforcement person who sends the wake-up-free control instruction;
detecting whether the voiceprint associated with the wake-up-free control instruction is matched with the voiceprint of the order enforcement person according to the incidence relation between the preset wake-up-free control instruction and the voiceprint;
and if the voiceprint associated with the wake-up-free control instruction is matched with the voiceprint of the order applying personnel, controlling the voice equipment to respond to the wake-up-free control instruction.
2. The method for controlling a voice device according to claim 1, wherein the monitoring the wake-up exempt control command includes:
if the voice data of the commander is collected, converting the voice data into text information;
extracting key information in the text information;
detecting whether a stored wake-up-free control instruction in the incidence relation between a preset wake-up-free control instruction and the voiceprint is matched with the key information or not;
and if the stored wake-up-free control instruction is matched with the key information, determining to monitor the wake-up-free control instruction.
3. The method for controlling the voice device according to claim 1, wherein after controlling the voice device to respond to the wake-up exempt control command, the method further comprises:
detecting whether historical voice data of the command enforcement personnel failing to control the voice equipment in a wake-up prevention mode exists in a historical time period; the termination time of the historical time period is the first moment when the voice data of the order applying personnel are collected; the starting time of the historical time period is a second time corresponding to the difference value between the first time and a preset time;
if historical voice data of the order applying personnel failing to wake up the voice equipment are stored in a historical time period, determining the intention of the order applying personnel according to the historical voice data;
judging whether the intention is matched with the wake-up-free control instruction;
and if the intention is matched with the wake-free control instruction, associating the historical voice data with the wake-free control instruction.
4. The method for controlling the voice device according to claim 3, wherein the associating the historical voice data with the wake-free control instruction comprises:
outputting associated prompt information so as to receive feedback information of the order applying personnel for the associated prompt information;
if the feedback information represents that association is forbidden, forbidding to execute the associated action of the historical voice data and the wake-up-free control instruction;
and if the feedback information indicates that the correlation is allowed, executing the correlation action of the historical voice data and the wake-up-free control instruction.
5. A control apparatus of a voice device, characterized by comprising:
the monitoring module is used for monitoring the wake-up-free control instruction; the wake-up-free control instruction is used for controlling voice equipment, and the wake-up-free control instruction carries voiceprints of an order enforcement person who sends the wake-up-free control instruction;
the detection module is used for detecting whether the voiceprint associated with the wake-up-free control instruction is matched with the voiceprint of the order enforcement personnel according to the association relation between the preset wake-up-free control instruction and the voiceprint;
and the control module is used for controlling the voice equipment to respond to the wake-up-free control instruction if the voiceprint associated with the wake-up-free control instruction is matched with the voiceprint of the order enforcement personnel.
6. The control device of the voice device according to claim 5, wherein the monitoring module is specifically configured to:
if the voice data of the commander is collected, converting the voice data into text information;
extracting key information in the text information;
detecting whether a stored wake-up-free control instruction in the incidence relation between a preset wake-up-free control instruction and the voiceprint is matched with the key information or not;
and if the stored wake-up-free control instruction is matched with the key information, determining to monitor the wake-up-free control instruction.
7. The control device of the voice apparatus according to claim 5, wherein the control module is further configured to:
detecting whether historical voice data of the command enforcement personnel failing to control the voice equipment in a wake-up prevention mode exists in a historical time period; the termination time of the historical time period is the first moment when the voice data of the order applying personnel are collected; the starting time of the historical time period is a second time corresponding to the difference value between the first time and a preset time;
if historical voice data of the order applying personnel failing to wake up the voice equipment are stored in a historical time period, determining the intention of the order applying personnel according to the historical voice data;
judging whether the intention is matched with the wake-up-free control instruction;
and if the intention is matched with the wake-free control instruction, associating the historical voice data with the wake-free control instruction.
8. The control device of the voice apparatus according to claim 5, wherein the control module is further configured to:
outputting associated prompt information so as to receive feedback information of the order applying personnel for the associated prompt information;
if the feedback information represents that association is forbidden, forbidding to execute the associated action of the historical voice data and the wake-up-free control instruction;
and if the feedback information indicates that the correlation is allowed, executing the correlation action of the historical voice data and the wake-up-free control instruction.
9. A control device for a speech device comprising a memory, a controller and a computer program stored on the memory and executable on the controller, characterized in that the controller implements the method according to any of claims 1 to 4 when executing the program.
10. A speech device characterized by being provided with the control device of the speech device of claim 9.
CN202010648687.5A 2020-07-07 2020-07-07 Voice equipment and control method, device and equipment thereof Pending CN111816192A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010648687.5A CN111816192A (en) 2020-07-07 2020-07-07 Voice equipment and control method, device and equipment thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010648687.5A CN111816192A (en) 2020-07-07 2020-07-07 Voice equipment and control method, device and equipment thereof

Publications (1)

Publication Number Publication Date
CN111816192A true CN111816192A (en) 2020-10-23

Family

ID=72841890

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010648687.5A Pending CN111816192A (en) 2020-07-07 2020-07-07 Voice equipment and control method, device and equipment thereof

Country Status (1)

Country Link
CN (1) CN111816192A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112201246A (en) * 2020-11-19 2021-01-08 深圳市欧瑞博科技股份有限公司 Intelligent control method and device based on voice, electronic equipment and storage medium
WO2023273321A1 (en) * 2021-06-29 2023-01-05 荣耀终端有限公司 Voice control method and electronic device
CN116074150A (en) * 2023-03-02 2023-05-05 广东浩博特科技股份有限公司 Switch control method and device for intelligent home and intelligent home
WO2024051611A1 (en) * 2022-09-05 2024-03-14 华为技术有限公司 Human-machine interaction method and related apparatus

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107665710A (en) * 2016-07-27 2018-02-06 上海博泰悦臻网络技术服务有限公司 Mobile terminal sound data processing method and device
US10079015B1 (en) * 2016-12-06 2018-09-18 Amazon Technologies, Inc. Multi-layer keyword detection
US20180301151A1 (en) * 2017-04-12 2018-10-18 Soundhound, Inc. Managing agent engagement in a man-machine dialog
CN109410952A (en) * 2018-10-26 2019-03-01 北京蓦然认知科技有限公司 A kind of voice awakening method, apparatus and system
CN110246498A (en) * 2019-07-15 2019-09-17 广东美的制冷设备有限公司 Method of speech processing, device and household appliance
CN111354360A (en) * 2020-03-17 2020-06-30 北京百度网讯科技有限公司 Voice interaction processing method and device and electronic equipment
CN113393834A (en) * 2020-03-11 2021-09-14 阿里巴巴集团控股有限公司 Control method and device

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107665710A (en) * 2016-07-27 2018-02-06 上海博泰悦臻网络技术服务有限公司 Mobile terminal sound data processing method and device
US10079015B1 (en) * 2016-12-06 2018-09-18 Amazon Technologies, Inc. Multi-layer keyword detection
US20180301151A1 (en) * 2017-04-12 2018-10-18 Soundhound, Inc. Managing agent engagement in a man-machine dialog
CN108847226A (en) * 2017-04-12 2018-11-20 声音猎手公司 The agency managed in human-computer dialogue participates in
CN109410952A (en) * 2018-10-26 2019-03-01 北京蓦然认知科技有限公司 A kind of voice awakening method, apparatus and system
CN110246498A (en) * 2019-07-15 2019-09-17 广东美的制冷设备有限公司 Method of speech processing, device and household appliance
CN113393834A (en) * 2020-03-11 2021-09-14 阿里巴巴集团控股有限公司 Control method and device
CN111354360A (en) * 2020-03-17 2020-06-30 北京百度网讯科技有限公司 Voice interaction processing method and device and electronic equipment

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112201246A (en) * 2020-11-19 2021-01-08 深圳市欧瑞博科技股份有限公司 Intelligent control method and device based on voice, electronic equipment and storage medium
CN112201246B (en) * 2020-11-19 2023-11-28 深圳市欧瑞博科技股份有限公司 Intelligent control method and device based on voice, electronic equipment and storage medium
WO2023273321A1 (en) * 2021-06-29 2023-01-05 荣耀终端有限公司 Voice control method and electronic device
WO2024051611A1 (en) * 2022-09-05 2024-03-14 华为技术有限公司 Human-machine interaction method and related apparatus
CN116074150A (en) * 2023-03-02 2023-05-05 广东浩博特科技股份有限公司 Switch control method and device for intelligent home and intelligent home
CN116074150B (en) * 2023-03-02 2023-06-09 广东浩博特科技股份有限公司 Switch control method and device for intelligent home and intelligent home

Similar Documents

Publication Publication Date Title
CN111816192A (en) Voice equipment and control method, device and equipment thereof
US11011172B2 (en) Electronic device and voice recognition method thereof
US11087769B1 (en) User authentication for voice-input devices
US9966076B2 (en) Voice control system and method
KR102444061B1 (en) Electronic device and method for recognizing voice of speech
US11256793B2 (en) Method and device for identity authentication
CN111670471B (en) Learning offline voice commands based on use of online voice commands
US20170193212A1 (en) Screen Interface Unlocking Method And Screen Interface Unlocking Device
CN105229724A (en) Mixed performance convergent-divergent or speech recognition
US20170311261A1 (en) Smart listening modes supporting quasi always-on listening
US20180285068A1 (en) Processing method of audio control and electronic device thereof
US20140189338A1 (en) Electronic device and method for detecting booting time period for electronic device
CN109032345B (en) Equipment control method, device, equipment, server and storage medium
CN110942768A (en) Equipment wake-up test method and device, mobile terminal and storage medium
US20150179184A1 (en) Compensating For Identifiable Background Content In A Speech Recognition Device
US9450554B2 (en) Electronic device and method for adjusting volume
US10950221B2 (en) Keyword confirmation method and apparatus
US20190362709A1 (en) Offline Voice Enrollment
CN111816178A (en) Voice equipment control method, device and equipment
CN111341315A (en) Voice control method, device, computer equipment and storage medium
KR102501083B1 (en) Method for voice detection and electronic device using the same
US9756141B2 (en) Media content consumption analytics
CN111612482A (en) Conversation management method, device and equipment
CN112017663A (en) Voice generalization method and device and computer storage medium
US20200410988A1 (en) Information processing device, information processing system, and information processing method, and program

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20201023