WO2018090252A1 - Voice instruction recognition method for robot, and related robot device - Google Patents

Voice instruction recognition method for robot, and related robot device Download PDF

Info

Publication number
WO2018090252A1
WO2018090252A1 PCT/CN2016/106118 CN2016106118W WO2018090252A1 WO 2018090252 A1 WO2018090252 A1 WO 2018090252A1 CN 2016106118 W CN2016106118 W CN 2016106118W WO 2018090252 A1 WO2018090252 A1 WO 2018090252A1
Authority
WO
WIPO (PCT)
Prior art keywords
robot
accuracy
voice command
threshold
preset
Prior art date
Application number
PCT/CN2016/106118
Other languages
French (fr)
Chinese (zh)
Inventor
骆磊
Original Assignee
深圳达闼科技控股有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳达闼科技控股有限公司 filed Critical 深圳达闼科技控股有限公司
Priority to CN201680002660.0A priority Critical patent/CN106796790B/en
Priority to PCT/CN2016/106118 priority patent/WO2018090252A1/en
Publication of WO2018090252A1 publication Critical patent/WO2018090252A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems

Definitions

  • Embodiments of the present invention relate to the field of robots, and in particular, to a method for recognizing a voice command of a robot and related robot devices.
  • the technical problem to be solved by the embodiments of the present invention is to provide a method for recognizing a voice command of a robot and a related robot device, which can better coordinate the accuracy and working efficiency of voice command recognition.
  • a technical solution adopted by the embodiment of the present invention is to provide a method for recognizing a voice command of a robot, comprising: after detecting that a voice is sent by a user, and the current voice command recognition accuracy is lower than a preset When the accuracy threshold is adjusted, the working state of the robot is adjusted until the accuracy of the adjusted voice command recognition of the robot reaches a preset accuracy threshold.
  • the method further includes: obtaining a current noise level; determining, according to the current noise level, a current voice instruction recognition accuracy rate according to a correspondence between the noise level and the recognition accuracy rate.
  • the preset accuracy threshold corresponds to the priority of the currently executing task of the robot; the higher the priority of executing the task, the lower the corresponding accuracy threshold.
  • the adjusting the working state of the robot until the adjusted accuracy of the adjusted voice command of the robot reaches a preset accuracy threshold includes: determining the work of the robot according to the difference between the current voice command recognition accuracy rate and the preset accuracy rate threshold value.
  • the adjustment range of the state; the movement of the robot is adjusted according to the determined adjustment range, so that the accuracy of the adjusted voice command recognition of the robot reaches a preset accuracy threshold.
  • the method further includes: when the adjusted accuracy of the voice command recognition of the robot does not reach the preset accuracy threshold, the user is notified that the current environmental noise is large or close to the user.
  • the user when the accuracy of the adjusted voice command recognition of the robot does not reach the preset accuracy threshold, the user is informed that the current environmental noise is large or close to the user, which means that the accuracy of the voice command recognition after the adjustment of the robot is not
  • the preset accuracy threshold is reached and the priority of the currently executed task is lower than the preset priority, the user is informed that the current environmental noise is large or close to the user.
  • adjusting the working state of the robot until the adjusted accuracy of the voice command recognition of the robot reaches a preset accuracy threshold includes: adjusting the motion of the robot according to the preset adjustment amplitude, until the robot corrects the adjusted voice command. The rate reaches the preset accuracy threshold.
  • the method further includes: when the robot stops the action after adjusting the working state, when the voice instruction recognition accuracy rate has not reached the preset accuracy rate threshold, the user is notified that the current environmental noise is large or close to the user.
  • adjusting the working state of the robot includes one or any of the following: placing the slow motion speed, reducing the motor speed, and turning off the non-human voice frequency steering gear.
  • a robot voice instruction recognition apparatus including: an adjustment module, after the user issues a voice, and the current voice instruction recognition accuracy is lower than Adjust the preset accuracy threshold The working state of the robot until the robot adjusts the accuracy of the voice command recognition to a preset accuracy threshold.
  • the device further includes: an obtaining module, configured to acquire a current noise level; and an accuracy determining module configured to determine, according to a current noise level, a current voice command recognition accuracy rate according to a correspondence between the noise level and the recognition accuracy rate.
  • the adjustment module includes: an amplitude determining unit, configured to determine, according to a difference between a current voice command recognition accuracy rate and a preset accuracy rate threshold value, an adjustment range of a working state of the robot; and a first adjusting unit, configured to The determined adjustment range adjusts the motion of the robot so that the accuracy of the adjusted voice command recognition of the robot reaches a preset accuracy threshold.
  • a robot apparatus including:
  • At least one processor At least one processor
  • the memory stores a program of instructions executable by the at least one processor, the program of instructions being executed by the at least one processor to cause the at least one processor to:
  • the working state of the robot After detecting the user's voice, and when the current voice command recognition accuracy is lower than the preset accuracy threshold, the working state of the robot is adjusted until the adjusted voice command recognition accuracy of the robot reaches a preset accuracy threshold.
  • the robot device after detecting that the user issues a voice command, and the current voice command recognition accuracy rate is lower than a preset accuracy rate threshold, the robot device adjusts its working state, for example, slow-motion speed, so that the adjustment is performed. After the voice command recognition accuracy reaches the preset accuracy rate threshold, the robot device can receive the voice command more accurately without stopping the action, and better coordinate the voice command recognition accuracy and work efficiency.
  • FIG. 1 is a schematic flow chart of an embodiment of a method for recognizing a voice command of a robot according to the present invention
  • FIG. 2 is a schematic structural diagram of an embodiment of a robot voice command recognition apparatus according to the present invention.
  • Figure 3 is a schematic view showing the structure of an embodiment of the robot apparatus of the present invention.
  • Embodiments of the present invention provide a method for recognizing a voice command of a robot, a robot voice command recognition device, and a robot device.
  • FIG. 1 is a schematic flow chart of an embodiment of a method for recognizing a voice command of a robot according to the present invention. As shown in FIG. 1, the embodiment includes:
  • Step 101 After detecting that the voice is sent by the user, determining whether the current voice instruction recognition accuracy is lower than a preset accuracy threshold;
  • the execution body of this embodiment may specifically be a robot device.
  • the robot device can continuously detect whether the user emits a voice. For example, when the user's voice is captured, the user is considered to be making a voice, and then it is determined whether the current voice command recognition accuracy is lower than a preset accuracy threshold.
  • the preset accuracy rate threshold corresponds to the priority of the currently executing task of the robot; the higher the priority of executing the task, the lower the corresponding accuracy rate threshold.
  • the higher the priority of the execution task the less likely it is to be disturbed by the voice command issued by the user.
  • the task priority of a robot is divided into 1, 2, 3, 4, and 5 levels (actually can be divided into any level), with level 1 being the highest priority and level 5 being the lowest priority.
  • the priority of voice commands can be defined between any two priorities, such as 1.5, 2.5, 3.5, and 4.5, to ensure that voice commands are not at the same priority as any task.
  • the accuracy thresholds preset according to the priority of the task are x1, x2, x3, x4, and x5, respectively, corresponding to the priority of the task, 1-5, the accuracy threshold x1 is the lowest, and the accuracy threshold x5 is the highest.
  • the current voice command recognition accuracy rate may be determined according to the current noise, for example, acquiring a current noise level; and determining a current voice command recognition accuracy rate according to a current noise level according to a correspondence relationship between the locally stored noise level and the recognition accuracy rate.
  • the noise level 1 is greater than or equal to 10 dB and less than 20 dB
  • the noise level 2 is greater than or equal to 20 dB and less than 30 dB. The higher the noise level, the lower the accuracy of the voice command recognition.
  • the method may further include: identifying the expression content of the user voice command.
  • determining whether the current voice command recognition accuracy is lower than the preset accuracy rate threshold may include: determining that the current voice command recognition accuracy rate is lower than a preset accuracy rate threshold when the expression content of the user voice command cannot be recognized.
  • Step 102 When it is determined that the current voice instruction recognition accuracy is lower than the preset accuracy rate threshold, adjust the working state of the robot until the adjusted voice command recognition accuracy of the robot reaches a preset accuracy threshold.
  • adjusting the working state of the robot may include one or any of the following: releasing the slow motion speed, reducing the motor speed, and turning off the non-human voice frequency steering gear, all of which help to reduce the operating noise of the robot itself or the object.
  • the influence of the voice command recognition accuracy rate can improve the accuracy of the current voice command recognition, and facilitate the robot device to correctly receive the voice command.
  • adjusting the working state of the robot until the adjusted accuracy of the voice command recognition of the robot reaches a preset accuracy threshold may specifically include: determining a difference between the accuracy rate and the preset accuracy threshold according to the current voice instruction.
  • the adjustment range of the working state of the robot is determined; the movement of the robot is adjusted according to the determined adjustment range, so that the accuracy of the adjusted voice command recognition of the robot reaches a preset accuracy threshold.
  • the accuracy of the voice command recognition of the robot should be able to reach the preset accuracy threshold. If it still cannot be reached, it is determined that the current noise is from the external environment rather than itself.
  • the movement can inform the user that the current environmental noise is large, may cause the identification to be incorrect, or approach the user to better receive the voice command.
  • the adjusted voice command recognition accuracy rate does not reach the preset accuracy rate threshold, and the priority of the currently executed task is lower than the preset priority, the user is informed that the current environmental noise is large or close to the user.
  • the higher the priority of the execution task the less likely it is to be disturbed by the voice commands issued by the user.
  • the movement of the robot can also be adjusted step by step, that is, the working state of the robot is adjusted until the accuracy of the adjusted voice command recognition of the robot reaches a preset accuracy threshold.
  • the method includes: adjusting the action of the robot step by step according to a preset adjustment range, until the accuracy of the adjusted voice command recognition of the robot reaches a preset accuracy rate threshold. For example, according to the stepwise decrement 20% adjustment, the corresponding action speed and amplitude of the vocal frequency band may be affected, and the current noise level is obtained again after each decrement, and it is determined whether the current voice command recognition accuracy is lower than a preset accuracy rate threshold, if lower than The preset accuracy threshold keeps the current motion speed and amplitude until the voice command is received.
  • the step is decremented until the robot stops moving, and the accuracy of the voice command recognition is still lower than the preset accuracy threshold, it is determined that the noise is from the external environment rather than the motion of the user, and the user may be informed that the current environmental noise is large, which may result in incorrect identification. Situation, or approaching the user to better receive voice commands.
  • Step 103 Continue to receive the voice sent by the user when determining that the current voice instruction recognition accuracy is not lower than the preset accuracy rate threshold.
  • the robot device after detecting that the user issues a voice command, and the current voice command recognition accuracy rate is lower than a preset accuracy rate threshold, the robot device adjusts its working state, for example, a slow motion speed, so that after the adjustment The voice command recognition accuracy reaches a preset accuracy rate threshold, so that the robot device can receive the voice command more accurately without stopping the action, and better coordinate the voice command recognition accuracy and work efficiency. If the adjustment speed of the robot is fast enough, the convergence can be completed quickly, and the user does not affect the task currently performed by the user. The experience is good. If the adjustment speed is slow, the convergence time may be longer, but it will still be better than no. The application of the method.
  • determining whether the current voice instruction recognition accuracy is lower than a preset accuracy threshold may not be performed after detecting that the user issues a voice instruction.
  • the robot device may determine whether the current voice command recognition accuracy rate is lower than a preset accuracy rate threshold after starting a new task, and save the judgment result locally, and read the judgment result after detecting that the user issues a voice command.
  • FIG. 2 is a schematic structural diagram of an embodiment of a robot voice command recognition apparatus according to the present invention. As shown in FIG. 2, this embodiment includes:
  • the detecting module 201 is configured to detect a voice command sent by the user
  • the detection module can continually detect whether the user issues a voice command, for example, when the user's voice is captured, the user is considered to be making a voice command.
  • the determining module 202 is configured to determine whether the current voice instruction recognition accuracy is lower than a preset accuracy threshold
  • the preset accuracy rate threshold corresponds to the priority of the currently executing task of the robot; the higher the priority of executing the task, the lower the corresponding accuracy rate threshold.
  • the higher the priority of the execution task the less likely it is to be disturbed by the voice command issued by the user.
  • the current speech instruction recognition accuracy rate can be determined based on the current noise.
  • the robot voice instruction identification may further include: an obtaining module, configured to acquire a current noise level; and an accuracy determining module configured to determine a current voice instruction recognition accuracy rate according to a current noise level according to a correspondence between the noise level and the recognition accuracy rate.
  • the trigger determining module 202 performs an operation.
  • the adjusting module 203 is configured to: after the detecting module detects that the user issues a voice command, and when the determining module determines that the current voice command recognition accuracy rate is lower than a preset accuracy rate threshold, adjust the working state of the robot to the adjusted voice of the robot.
  • the command recognition accuracy reaches a preset accuracy threshold.
  • the correspondence between the accuracy of the voice command recognition accuracy and the adjustment range can be preset according to experience, so that the working state of the robot can be adjusted according to the difference.
  • the adjustment module 203 may include: an amplitude determining unit, configured to determine an adjustment range of an operating state of the robot according to a difference between the current voice instruction identification accuracy rate and the preset accuracy rate threshold value; and the first adjusting unit is configured to follow The determined adjustment range adjusts the motion of the robot so that the accuracy of the adjusted voice command recognition of the robot reaches a preset accuracy threshold.
  • the adjustment module 203 may include: a second adjustment unit, configured to step adjust the motion of the robot according to the preset adjustment amplitude, until the accuracy of the adjusted voice command recognition of the robot reaches a preset accuracy threshold.
  • the adjusting module 203 adjusts the working state of the robot, which may include one or any of the following: releasing the slow motion speed, reducing the motor speed, and turning off the non-human voice frequency steering gear, which These can help reduce the impact of the robot's own or the operating noise of the object on the accuracy of the voice command recognition, can improve the current voice command recognition accuracy, and facilitate the robot device to correctly receive the voice command.
  • the adjusting module adjusts its working state, for example, slow motion speed.
  • the robot device can receive the voice command more accurately without stopping the action, and the voice command recognition accuracy and work efficiency are well coordinated.
  • the determining module 202 may also perform an operation after the detecting module 201 detects that the user issues a voice command. For example, the judging module 202 may determine whether the current speech instruction recognition accuracy rate is lower than a preset accuracy rate threshold after the robot starts a new task, and save the determination result locally, and the detection module 201 detects that the user issues a voice instruction, and the robot voice instruction The identification device can directly read the judgment result.
  • the robot voice instruction recognition apparatus may not include the detection module 201 and the determination module 202, but after the user issues a voice by the external device, and when the current voice instruction recognition accuracy is lower than a preset accuracy threshold, The trigger adjustment module 203 adjusts the working state of the robot.
  • FIG. 3 is a schematic structural view of an embodiment of a robot apparatus according to the present invention.
  • the robot apparatus 300 includes:
  • At least one processor 310 exemplified by a processor 310 in FIG. 3; and a memory 320 communicatively coupled to at least one processor 310; wherein the memory stores a program of instructions executable by at least one processor, the program of instructions being at least A processor executes to enable at least one processor to perform the above-described method of robot voice instruction recognition.
  • the processor 310 and the memory 320 may be connected by a bus or other means, as exemplified by a bus connection in FIG.
  • the memory 320 is used as a non-volatile computer readable storage medium, and can be used for storing a non-volatile software program, a non-volatile computer executable program, and a module, and the method for recognizing the robot voice instruction in the embodiment of the present application corresponds to Program instructions/modules.
  • Processor 310 Performing various functional applications and data processing of the robot apparatus by executing non-volatile software programs, instructions, and modules stored in the memory 320, that is, a method for realizing robot voice instruction recognition applied to the robot apparatus of the above method embodiment .
  • the memory 320 may include a storage program area and an storage data area, wherein the storage program area may store an operating system, an application required for at least one function; the storage data area may store data created by use of the method of the above-described robot voice instruction identification, etc. .
  • memory 320 can include high speed random access memory, and can also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid state storage device.
  • memory 320 can include memory remotely located relative to processor 310, which can be connected to the robotic device over a network. Examples of such networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
  • One or more modules are stored in memory 320, and when executed by one or more processors 310, perform the method of robotic voice instruction recognition applied to the robotic device in any of the method embodiments described above.
  • the disclosed system, apparatus, and method may be implemented in other manners.
  • the device embodiments described above are merely illustrative.
  • the division of the unit is only a logical function division.
  • there may be another division manner for example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored or not executed.
  • the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be in an electrical, mechanical or other form.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
  • each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
  • the above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Manipulator (AREA)

Abstract

A voice instruction recognition method for a robot, and a related robot device. The method comprises: when it is detected that a user starts speaking and the current voice instruction recognition accuracy is less than a pre-set accuracy threshold value, adjusting an operating state of a robot until the adjusted voice instruction recognition accuracy of the robot reaches the pre-set accuracy threshold value. The method can enable a robot device to receive a voice instruction more accurately without stopping operation, and better coordinates the voice instruction recognition accuracy and operating efficiency.

Description

机器人语音指令识别的方法及相关机器人装置Robot voice command recognition method and related robot device 技术领域Technical field
本发明实施方式涉及机器人领域,特别是涉及一种机器人语音指令识别的方法及相关机器人装置。Embodiments of the present invention relate to the field of robots, and in particular, to a method for recognizing a voice command of a robot and related robot devices.
背景技术Background technique
当前可运动有机械动作能力的机器人在工作时,由于舵机和/或马达的运转,或者对其他物体的操作或清洁(如开门操作、洗碗操作、扫地操作等等),都不可避免地产生噪声,而如果用户在这个过程中又对机器人发出了语音指令,噪声就必然对机器人语音识别的准确率造成一定的影响。即使有降噪算法,也只能对稳态噪声进行过滤,对机器人工作过程中变化的和不可预测的噪声很难滤除。另一种方法就是机器人在检测到用户发出语音后则停止当前的动作,等接收到语音指令后再继续之前的动作,但这样做又必然会降低工作效率。Currently, robots with mechanical motion capability are inevitably operated during operation due to the operation of the steering gear and/or the motor, or the operation or cleaning of other objects (such as door opening operation, dishwashing operation, sweeping operation, etc.). Noise is generated, and if the user issues a voice command to the robot in the process, the noise will inevitably have an impact on the accuracy of the robot voice recognition. Even with the noise reduction algorithm, only the steady-state noise can be filtered, and it is difficult to filter out the variable and unpredictable noise during the working process of the robot. Another method is that the robot stops the current action after detecting the user's voice, and waits for the voice command to continue the previous action, but this will inevitably reduce the work efficiency.
鉴于此,克服上述现有技术所存在的缺陷是本技术领域亟待解决的问题。In view of this, it is an urgent problem to be solved in the art to overcome the drawbacks of the prior art described above.
发明内容Summary of the invention
本发明实施方式主要解决的技术问题是提供一种机器人语音指令识别的方法及相关机器人装置,能够较好地协调语音指令识别准确率与工作效率。The technical problem to be solved by the embodiments of the present invention is to provide a method for recognizing a voice command of a robot and a related robot device, which can better coordinate the accuracy and working efficiency of voice command recognition.
为解决上述技术问题,本发明实施方式采用的一个技术方案是:提供一种机器人语音指令识别的方法,包括:在检测到用户发出语音后,且在当前语音指令识别准确率低于预设的准确率阈值时,调整机器人的工作状态至机器人在调整后的语音指令识别准确率达到预设的准确率阈值。 In order to solve the above technical problem, a technical solution adopted by the embodiment of the present invention is to provide a method for recognizing a voice command of a robot, comprising: after detecting that a voice is sent by a user, and the current voice command recognition accuracy is lower than a preset When the accuracy threshold is adjusted, the working state of the robot is adjusted until the accuracy of the adjusted voice command recognition of the robot reaches a preset accuracy threshold.
其中,上述方法还包括:获取当前噪声等级;根据当前噪声等级,按照噪声等级与识别准确率的对应关系确定当前语音指令识别准确率。The method further includes: obtaining a current noise level; determining, according to the current noise level, a current voice instruction recognition accuracy rate according to a correspondence between the noise level and the recognition accuracy rate.
其中,预设的准确率阈值与机器人当前执行任务的优先级对应;执行任务的优先级越高,对应的准确率阈值越低。The preset accuracy threshold corresponds to the priority of the currently executing task of the robot; the higher the priority of executing the task, the lower the corresponding accuracy threshold.
其中,调整机器人的工作状态至机器人在调整后的语音指令识别准确率达到预设的准确率阈值包括:根据当前语音指令识别准确率和预设的准确率阈值的差值,确定对机器人的工作状态的调整幅度;按照所确定的调整幅度调整机器人的动作,使机器人在调整后的语音指令识别准确率达到预设的准确率阈值。The adjusting the working state of the robot until the adjusted accuracy of the adjusted voice command of the robot reaches a preset accuracy threshold includes: determining the work of the robot according to the difference between the current voice command recognition accuracy rate and the preset accuracy rate threshold value. The adjustment range of the state; the movement of the robot is adjusted according to the determined adjustment range, so that the accuracy of the adjusted voice command recognition of the robot reaches a preset accuracy threshold.
其中,上述方法还包括:在机器人在调整后的语音指令识别准确率未达到预设的准确率阈值时,告知用户当前环境噪声较大或向用户靠近。The method further includes: when the adjusted accuracy of the voice command recognition of the robot does not reach the preset accuracy threshold, the user is notified that the current environmental noise is large or close to the user.
其中,在机器人在调整后的语音指令识别准确率未达到预设的准确率阈值时,告知用户当前环境噪声较大或向用户靠近,是指:在机器人在调整后的语音指令识别准确率未达到预设的准确率阈值且当前执行任务的优先级低于预设优先级时,告知用户当前环境噪声较大或向用户靠近。Wherein, when the accuracy of the adjusted voice command recognition of the robot does not reach the preset accuracy threshold, the user is informed that the current environmental noise is large or close to the user, which means that the accuracy of the voice command recognition after the adjustment of the robot is not When the preset accuracy threshold is reached and the priority of the currently executed task is lower than the preset priority, the user is informed that the current environmental noise is large or close to the user.
其中,调整机器人的工作状态至机器人在调整后的语音指令识别准确率达到预设的准确率阈值包括:按照预设的调整幅度步进调整机器人的动作,至机器人在调整后的语音指令识别准确率达到预设的准确率阈值。Wherein, adjusting the working state of the robot until the adjusted accuracy of the voice command recognition of the robot reaches a preset accuracy threshold includes: adjusting the motion of the robot according to the preset adjustment amplitude, until the robot corrects the adjusted voice command. The rate reaches the preset accuracy threshold.
其中,上述方法还包括:在机器人在调整工作状态后停止动作时,语音指令识别准确率仍未达到预设的准确率阈值时,告知用户当前环境噪声较大或向用户靠近。The method further includes: when the robot stops the action after adjusting the working state, when the voice instruction recognition accuracy rate has not reached the preset accuracy rate threshold, the user is notified that the current environmental noise is large or close to the user.
其中,调整机器人的工作状态包括如下一种或者任意几种:放慢动作速度、降低马达转速、关闭非人声频段舵机。Among them, adjusting the working state of the robot includes one or any of the following: placing the slow motion speed, reducing the motor speed, and turning off the non-human voice frequency steering gear.
为解决上述技术问题,本发明实施方式采用的另一个技术方案是:提供一种机器人语音指令识别装置,包括:调整模块,用于在用户发出语音后,且在当前语音指令识别准确率低于预设的准确率阈值时,调整 机器人的工作状态至机器人在调整后的语音指令识别准确率达到预设的准确率阈值。In order to solve the above technical problem, another technical solution adopted by the embodiment of the present invention is to provide a robot voice instruction recognition apparatus, including: an adjustment module, after the user issues a voice, and the current voice instruction recognition accuracy is lower than Adjust the preset accuracy threshold The working state of the robot until the robot adjusts the accuracy of the voice command recognition to a preset accuracy threshold.
其中,上述装置还包括:获取模块,用于获取当前噪声等级;准确率确定模块,用于根据当前噪声等级,按照噪声等级与识别准确率的对应关系确定当前语音指令识别准确率。The device further includes: an obtaining module, configured to acquire a current noise level; and an accuracy determining module configured to determine, according to a current noise level, a current voice command recognition accuracy rate according to a correspondence between the noise level and the recognition accuracy rate.
其中,上述调整模块包括:幅度确定单元,用于根据当前语音指令识别准确率和预设的准确率阈值的差值,确定对机器人的工作状态的调整幅度;第一调整单元,用于按照所确定的调整幅度调整机器人的动作,使机器人在调整后的语音指令识别准确率达到预设的准确率阈值。The adjustment module includes: an amplitude determining unit, configured to determine, according to a difference between a current voice command recognition accuracy rate and a preset accuracy rate threshold value, an adjustment range of a working state of the robot; and a first adjusting unit, configured to The determined adjustment range adjusts the motion of the robot so that the accuracy of the adjusted voice command recognition of the robot reaches a preset accuracy threshold.
为解决上述技术问题,本发明实施方式采用的另一个技术方案是:提供一种机器人装置,包括:In order to solve the above technical problem, another technical solution adopted by the embodiment of the present invention is to provide a robot apparatus, including:
至少一个处理器;以及At least one processor;
与至少一个处理器连接的存储器;其中,a memory coupled to at least one processor; wherein
存储器存储有可被至少一个处理器执行的指令程序,指令程序被至少一个处理器执行,以使至少一个处理器用于:The memory stores a program of instructions executable by the at least one processor, the program of instructions being executed by the at least one processor to cause the at least one processor to:
在检测到用户发出语音后,且在当前语音指令识别准确率低于预设的准确率阈值时,调整机器人的工作状态至机器人在调整后的语音指令识别准确率达到预设的准确率阈值。After detecting the user's voice, and when the current voice command recognition accuracy is lower than the preset accuracy threshold, the working state of the robot is adjusted until the adjusted voice command recognition accuracy of the robot reaches a preset accuracy threshold.
与现有技术相比,本发明实施方式的有益效果是:Compared with the prior art, the beneficial effects of the embodiments of the present invention are:
本发明实施例中,在检测到用户发出语音指令后,且在当前语音指令识别准确率低于预设的准确率阈值时,机器人装置调整自身的工作状态,例如放慢动作速度,以使得调整后的语音指令识别准确率达到预设的准确率阈值,从而使得机器人装置无需停止动作即可更为准确地接收语音指令,较好地协调了语音指令识别准确率与工作效率。In the embodiment of the present invention, after detecting that the user issues a voice command, and the current voice command recognition accuracy rate is lower than a preset accuracy rate threshold, the robot device adjusts its working state, for example, slow-motion speed, so that the adjustment is performed. After the voice command recognition accuracy reaches the preset accuracy rate threshold, the robot device can receive the voice command more accurately without stopping the action, and better coordinate the voice command recognition accuracy and work efficiency.
附图说明DRAWINGS
图1是本发明机器人语音指令识别的方法的一个实施例的流程示意图;1 is a schematic flow chart of an embodiment of a method for recognizing a voice command of a robot according to the present invention;
图2是本发明机器人语音指令识别装置的一个实施例的结构示意图; 2 is a schematic structural diagram of an embodiment of a robot voice command recognition apparatus according to the present invention;
图3是本发明机器人装置的一个实施例的结构示意图。Figure 3 is a schematic view showing the structure of an embodiment of the robot apparatus of the present invention.
具体实施方式detailed description
本发明实施例提供了一种机器人语音指令识别的方法、机器人语音指令识别装置及机器人装置。Embodiments of the present invention provide a method for recognizing a voice command of a robot, a robot voice command recognition device, and a robot device.
请参阅图1,图1是本发明机器人语音指令识别的方法的一个实施例的流程示意图。如图1所示,本实施例包括:Please refer to FIG. 1. FIG. 1 is a schematic flow chart of an embodiment of a method for recognizing a voice command of a robot according to the present invention. As shown in FIG. 1, the embodiment includes:
步骤101:在检测到用户发出语音后,判断当前语音指令识别准确率是否低于预设的准确率阈值;Step 101: After detecting that the voice is sent by the user, determining whether the current voice instruction recognition accuracy is lower than a preset accuracy threshold;
本实施例的执行主体具体可以是机器人装置。机器人装置在执行任务过程中,可以不断地检测用户是否发出语音,例如可以在捕捉到用户声音时,认为用户正在发出语音,而后判断当前语音指令识别准确率是否低于预设的准确率阈值。The execution body of this embodiment may specifically be a robot device. During the execution of the task, the robot device can continuously detect whether the user emits a voice. For example, when the user's voice is captured, the user is considered to be making a voice, and then it is determined whether the current voice command recognition accuracy is lower than a preset accuracy threshold.
优选地,预设的准确率阈值与机器人当前执行任务的优先级对应;执行任务的优先级越高,对应的准确率阈值越低。这样,优先级越高的执行任务越不容易被用户发出的语音指令打扰。Preferably, the preset accuracy rate threshold corresponds to the priority of the currently executing task of the robot; the higher the priority of executing the task, the lower the corresponding accuracy rate threshold. Thus, the higher the priority of the execution task, the less likely it is to be disturbed by the voice command issued by the user.
例如,机器人的任务优先级分为1,2,3,4,5共五级(实际可分为任意级),1级为最高优先级,5级为最低优先级。语音指令的优先级可定义为任意两个优先级之间,如1.5,2.5,3.5,4.5,以保证语音命令与任何任务不处于相同优先级。根据任务的优先级预设的准确率阈值分别为x1,x2,x3,x4,x5,对应任务优先级的1-5级,准确率阈值x1最低,准确率阈值x5最高。For example, the task priority of a robot is divided into 1, 2, 3, 4, and 5 levels (actually can be divided into any level), with level 1 being the highest priority and level 5 being the lowest priority. The priority of voice commands can be defined between any two priorities, such as 1.5, 2.5, 3.5, and 4.5, to ensure that voice commands are not at the same priority as any task. The accuracy thresholds preset according to the priority of the task are x1, x2, x3, x4, and x5, respectively, corresponding to the priority of the task, 1-5, the accuracy threshold x1 is the lowest, and the accuracy threshold x5 is the highest.
可以根据当前噪声来确定当前语音指令识别准确率,例如:获取当前噪声等级;根据当前噪声等级,按照本地保存的噪声等级与识别准确率的对应关系确定当前语音指令识别准确率。例如,噪声等级1为大于等于10dB且小于20dB,噪声等级2为大于等于20dB且小于30dB,噪声等级越高,语音指令识别准确率越低。The current voice command recognition accuracy rate may be determined according to the current noise, for example, acquiring a current noise level; and determining a current voice command recognition accuracy rate according to a current noise level according to a correspondence relationship between the locally stored noise level and the recognition accuracy rate. For example, the noise level 1 is greater than or equal to 10 dB and less than 20 dB, and the noise level 2 is greater than or equal to 20 dB and less than 30 dB. The higher the noise level, the lower the accuracy of the voice command recognition.
也可以在检测到用户发出语音指令后,尝试识别一次语音指令的表达内容,如果不能识别,则判定当前语音指令识别准确率低于预设的准 确率阈值。例如,在步骤101中,在检测到用户发出语音指令后,在判断当前语音指令识别准确率是否低于预设的准确率阈值之前还可以包括:识别用户语音指令的表达内容。此时,判断当前语音指令识别准确率是否低于预设的准确率阈值具体可以包括:在无法识别用户语音指令的表达内容时,判定当前语音指令识别准确率低于预设的准确率阈值。It is also possible to try to recognize the expression content of the voice instruction after detecting the user issuing the voice instruction, and if not, determine that the current voice instruction recognition accuracy is lower than the preset criterion. The threshold is determined. For example, in step 101, after detecting that the voice command is sent by the user, before determining whether the current voice command recognition accuracy is lower than the preset accuracy rate threshold, the method may further include: identifying the expression content of the user voice command. At this time, determining whether the current voice command recognition accuracy is lower than the preset accuracy rate threshold may include: determining that the current voice command recognition accuracy rate is lower than a preset accuracy rate threshold when the expression content of the user voice command cannot be recognized.
步骤102:在判定当前语音指令识别准确率低于预设的准确率阈值时,调整机器人的工作状态至机器人在调整后的语音指令识别准确率达到预设的准确率阈值。Step 102: When it is determined that the current voice instruction recognition accuracy is lower than the preset accuracy rate threshold, adjust the working state of the robot until the adjusted voice command recognition accuracy of the robot reaches a preset accuracy threshold.
具体地,调整机器人的工作状态可以包括如下一种或者任意几种:放慢动作速度、降低马达转速、关闭非人声频段舵机,这些都有助于降低机器人自身或对物体的操作噪声对语音指令识别准确率的影响,可以提高当前语音指令识别准确率,利于机器人装置正确接收语音指令。Specifically, adjusting the working state of the robot may include one or any of the following: releasing the slow motion speed, reducing the motor speed, and turning off the non-human voice frequency steering gear, all of which help to reduce the operating noise of the robot itself or the object. The influence of the voice command recognition accuracy rate can improve the accuracy of the current voice command recognition, and facilitate the robot device to correctly receive the voice command.
可以根据经验预先设定语音指令识别准确率差值与调整幅度的对应关系,从而可以根据该差值来对机器人的工作状态进行调整。因此,步骤102中,调整机器人的工作状态至机器人在调整后的语音指令识别准确率达到预设的准确率阈值具体可以包括:根据当前语音指令识别准确率和预设的准确率阈值的差值,确定对机器人的工作状态的调整幅度;按照所确定的调整幅度调整机器人的动作,使机器人在调整后的语音指令识别准确率达到预设的准确率阈值。The correspondence between the accuracy of the voice command recognition accuracy and the adjustment range can be preset according to experience, so that the working state of the robot can be adjusted according to the difference. Therefore, in step 102, adjusting the working state of the robot until the adjusted accuracy of the voice command recognition of the robot reaches a preset accuracy threshold may specifically include: determining a difference between the accuracy rate and the preset accuracy threshold according to the current voice instruction. The adjustment range of the working state of the robot is determined; the movement of the robot is adjusted according to the determined adjustment range, so that the accuracy of the adjusted voice command recognition of the robot reaches a preset accuracy threshold.
正常情况下,根据上述差值来对机器人的工作状态进行调整后,机器人的语音指令识别准确率应该能够达到预设的准确率阈值,如果仍然不能达到,则判定当前噪声来自外界环境而非自身运动,可以告知用户当前环境噪声较大,可能造成识别不正确的情况,或者向用户靠近以更好地接收语音指令。优选地,在调整后的语音指令识别准确率未达到预设的准确率阈值的情况下,且当前执行任务的优先级低于预设优先级时,告知用户当前环境噪声较大或向用户靠近,这样,优先级越高的执行任务越不容易被用户发出的语音指令打扰。Under normal circumstances, after adjusting the working state of the robot according to the above difference, the accuracy of the voice command recognition of the robot should be able to reach the preset accuracy threshold. If it still cannot be reached, it is determined that the current noise is from the external environment rather than itself. The movement can inform the user that the current environmental noise is large, may cause the identification to be incorrect, or approach the user to better receive the voice command. Preferably, when the adjusted voice command recognition accuracy rate does not reach the preset accuracy rate threshold, and the priority of the currently executed task is lower than the preset priority, the user is informed that the current environmental noise is large or close to the user. Thus, the higher the priority of the execution task, the less likely it is to be disturbed by the voice commands issued by the user.
当然,也可以步进式地调整机器人的动作,即调整机器人的工作状态至机器人在调整后的语音指令识别准确率达到预设的准确率阈值可 以包括:按照预设的调整幅度步进调整机器人的动作,至机器人在调整后的语音指令识别准确率达到预设的准确率阈值。例如,按照阶梯递减20%调整可影响人声频段的相应动作速度和幅度,每次递减完再次获取当前噪声等级,判断当前语音指令识别准确率是否低于预设的准确率阈值,如果低于预设的准确率阈值,则保持当前动作速度和幅度等,直到语音指令接收完成。Of course, the movement of the robot can also be adjusted step by step, that is, the working state of the robot is adjusted until the accuracy of the adjusted voice command recognition of the robot reaches a preset accuracy threshold. The method includes: adjusting the action of the robot step by step according to a preset adjustment range, until the accuracy of the adjusted voice command recognition of the robot reaches a preset accuracy rate threshold. For example, according to the stepwise decrement 20% adjustment, the corresponding action speed and amplitude of the vocal frequency band may be affected, and the current noise level is obtained again after each decrement, and it is determined whether the current voice command recognition accuracy is lower than a preset accuracy rate threshold, if lower than The preset accuracy threshold keeps the current motion speed and amplitude until the voice command is received.
如果阶梯递减直到机器人停止动作,语音指令识别准确率仍然低于预设的准确率阈值,则判定此噪声来自外界环境而非自身运动,可以告知用户当前环境噪声较大,可能造成识别不正确的情况,或者向用户靠近以更好地接收语音指令。If the step is decremented until the robot stops moving, and the accuracy of the voice command recognition is still lower than the preset accuracy threshold, it is determined that the noise is from the external environment rather than the motion of the user, and the user may be informed that the current environmental noise is large, which may result in incorrect identification. Situation, or approaching the user to better receive voice commands.
步骤103:在判定当前语音指令识别准确率不低于预设的准确率阈值时,继续接收用户发出的语音。Step 103: Continue to receive the voice sent by the user when determining that the current voice instruction recognition accuracy is not lower than the preset accuracy rate threshold.
本实施例中,在检测到用户发出语音指令后,且在当前语音指令识别准确率低于预设的准确率阈值时,机器人装置调整自身的工作状态,例如放慢动作速度,以使得调整后的语音指令识别准确率达到预设的准确率阈值,从而使得机器人装置无需停止动作即可更为准确地接收语音指令,较好地协调了语音指令识别准确率与工作效率。如果机器人的调整速度足够快,则能够很快的完成收敛,基本不影响用户当前执行的任务,体验很好;而如果调整速度较慢,则可能收敛时间较长,但依然会好于没有此方法应用的情况。In this embodiment, after detecting that the user issues a voice command, and the current voice command recognition accuracy rate is lower than a preset accuracy rate threshold, the robot device adjusts its working state, for example, a slow motion speed, so that after the adjustment The voice command recognition accuracy reaches a preset accuracy rate threshold, so that the robot device can receive the voice command more accurately without stopping the action, and better coordinate the voice command recognition accuracy and work efficiency. If the adjustment speed of the robot is fast enough, the convergence can be completed quickly, and the user does not affect the task currently performed by the user. The experience is good. If the adjustment speed is slow, the convergence time may be longer, but it will still be better than no. The application of the method.
需要说明的是,在一些实施例中,判断当前语音指令识别准确率是否低于预设的准确率阈值可以不在检测到用户发出语音指令后进行。例如,机器人装置可以在开始新任务后判断当前语音指令识别准确率是否低于预设的准确率阈值,并在本地保存判断结果,在检测到用户发出语音指令后读取该判断结果即可。It should be noted that, in some embodiments, determining whether the current voice instruction recognition accuracy is lower than a preset accuracy threshold may not be performed after detecting that the user issues a voice instruction. For example, the robot device may determine whether the current voice command recognition accuracy rate is lower than a preset accuracy rate threshold after starting a new task, and save the judgment result locally, and read the judgment result after detecting that the user issues a voice command.
实施例二Embodiment 2
请参阅图2,图2是本发明机器人语音指令识别装置的一个实施例的结构示意图。如图2所示,本实施例包括: Please refer to FIG. 2. FIG. 2 is a schematic structural diagram of an embodiment of a robot voice command recognition apparatus according to the present invention. As shown in FIG. 2, this embodiment includes:
检测模块201,用于检测用户发出的语音指令;The detecting module 201 is configured to detect a voice command sent by the user;
检测模块可以不断地检测用户是否发出语音指令,例如可以在捕捉到用户声音时,认为用户正在发出语音指令。The detection module can continually detect whether the user issues a voice command, for example, when the user's voice is captured, the user is considered to be making a voice command.
判断模块202,用于判断当前语音指令识别准确率是否低于预设的准确率阈值;The determining module 202 is configured to determine whether the current voice instruction recognition accuracy is lower than a preset accuracy threshold;
优选地,预设的准确率阈值与机器人当前执行任务的优先级对应;执行任务的优先级越高,对应的准确率阈值越低。这样,优先级越高的执行任务越不容易被用户发出的语音指令打扰。Preferably, the preset accuracy rate threshold corresponds to the priority of the currently executing task of the robot; the higher the priority of executing the task, the lower the corresponding accuracy rate threshold. Thus, the higher the priority of the execution task, the less likely it is to be disturbed by the voice command issued by the user.
可以根据当前噪声来确定当前语音指令识别准确率。例如,机器人语音指令识别还可以包括:获取模块,用于获取当前噪声等级;准确率确定模块,用于根据当前噪声等级,按照噪声等级与识别准确率的对应关系确定当前语音指令识别准确率。The current speech instruction recognition accuracy rate can be determined based on the current noise. For example, the robot voice instruction identification may further include: an obtaining module, configured to acquire a current noise level; and an accuracy determining module configured to determine a current voice instruction recognition accuracy rate according to a current noise level according to a correspondence between the noise level and the recognition accuracy rate.
本实施例中,检测模块201检测到用户发出语音指令之后,触发判断模块202执行操作。In this embodiment, after the detecting module 201 detects that the user issues a voice instruction, the trigger determining module 202 performs an operation.
调整模块203,用于在检测模块检测到用户发出语音指令后,且在判断模块判定当前语音指令识别准确率低于预设的准确率阈值时,调整机器人的工作状态至机器人在调整后的语音指令识别准确率达到预设的准确率阈值。The adjusting module 203 is configured to: after the detecting module detects that the user issues a voice command, and when the determining module determines that the current voice command recognition accuracy rate is lower than a preset accuracy rate threshold, adjust the working state of the robot to the adjusted voice of the robot. The command recognition accuracy reaches a preset accuracy threshold.
可以根据经验预先设定语音指令识别准确率差值与调整幅度的对应关系,从而可以根据该差值来对机器人的工作状态进行调整。例如,调整模块203可以包括:幅度确定单元,用于根据当前语音指令识别准确率和预设的准确率阈值的差值,确定对机器人的工作状态的调整幅度;第一调整单元,用于按照所确定的调整幅度调整机器人的动作,使机器人在调整后的语音指令识别准确率达到预设的准确率阈值。The correspondence between the accuracy of the voice command recognition accuracy and the adjustment range can be preset according to experience, so that the working state of the robot can be adjusted according to the difference. For example, the adjustment module 203 may include: an amplitude determining unit, configured to determine an adjustment range of an operating state of the robot according to a difference between the current voice instruction identification accuracy rate and the preset accuracy rate threshold value; and the first adjusting unit is configured to follow The determined adjustment range adjusts the motion of the robot so that the accuracy of the adjusted voice command recognition of the robot reaches a preset accuracy threshold.
也可以步进式地调整机器人的动作。例如,调整模块203可以包括:第二调整单元,用于按照预设的调整幅度步进调整机器人的动作,至机器人在调整后的语音指令识别准确率达到预设的准确率阈值。It is also possible to adjust the movement of the robot step by step. For example, the adjustment module 203 may include: a second adjustment unit, configured to step adjust the motion of the robot according to the preset adjustment amplitude, until the accuracy of the adjusted voice command recognition of the robot reaches a preset accuracy threshold.
具体地,调整模块203调整机器人的工作状态可以包括如下一种或者任意几种:放慢动作速度、降低马达转速、关闭非人声频段舵机,这 些都有助于降低机器人自身或对物体的操作噪声对语音指令识别准确率的影响,可以提高当前语音指令识别准确率,利于机器人装置正确接收语音指令。Specifically, the adjusting module 203 adjusts the working state of the robot, which may include one or any of the following: releasing the slow motion speed, reducing the motor speed, and turning off the non-human voice frequency steering gear, which These can help reduce the impact of the robot's own or the operating noise of the object on the accuracy of the voice command recognition, can improve the current voice command recognition accuracy, and facilitate the robot device to correctly receive the voice command.
本实施例中,在检测模块检测到用户发出语音指令后,且在判断模块判定当前语音指令识别准确率低于预设的准确率阈值时,调整模块调整自身的工作状态,例如放慢动作速度,以使得调整后的语音指令识别准确率达到预设的准确率阈值,从而使得机器人装置无需停止动作即可更为准确地接收语音指令,很好地协调了语音指令识别准确率与工作效率。In this embodiment, after the detecting module detects that the user issues a voice command, and when the determining module determines that the current voice command recognition accuracy rate is lower than a preset accuracy rate threshold, the adjusting module adjusts its working state, for example, slow motion speed. In order to make the adjusted speech command recognition accuracy reach the preset accuracy rate threshold, the robot device can receive the voice command more accurately without stopping the action, and the voice command recognition accuracy and work efficiency are well coordinated.
在一些实施例中,判断模块202也可以不在检测模块201检测到用户发出语音指令后执行操作。例如,判断模块202可以在机器人开始新任务后判断当前语音指令识别准确率是否低于预设的准确率阈值,并在本地保存判断结果,检测模块201检测到用户发出语音指令后,机器人语音指令识别装置直接读取该判断结果即可。In some embodiments, the determining module 202 may also perform an operation after the detecting module 201 detects that the user issues a voice command. For example, the judging module 202 may determine whether the current speech instruction recognition accuracy rate is lower than a preset accuracy rate threshold after the robot starts a new task, and save the determination result locally, and the detection module 201 detects that the user issues a voice instruction, and the robot voice instruction The identification device can directly read the judgment result.
在一些实施例中,机器人语音指令识别装置也可以不包括检测模块201与判断模块202,而由外部设备在用户发出语音后,且在当前语音指令识别准确率低于预设的准确率阈值时,触发调整模块203调整机器人的工作状态。In some embodiments, the robot voice instruction recognition apparatus may not include the detection module 201 and the determination module 202, but after the user issues a voice by the external device, and when the current voice instruction recognition accuracy is lower than a preset accuracy threshold, The trigger adjustment module 203 adjusts the working state of the robot.
实施例三Embodiment 3
请参阅图3,图3是本发明机器人装置的一个实施例的结构示意图。如图3所示,机器人装置300包括:Please refer to FIG. 3. FIG. 3 is a schematic structural view of an embodiment of a robot apparatus according to the present invention. As shown in FIG. 3, the robot apparatus 300 includes:
至少一个处理器310,图3中以一个处理器310为例;以及与至少一个处理器310通信连接的存储器320;其中,存储器存储有可被至少一个处理器执行的指令程序,指令程序被至少一个处理器执行,以使至少一个处理器能够执行上述机器人语音指令识别的方法。At least one processor 310, exemplified by a processor 310 in FIG. 3; and a memory 320 communicatively coupled to at least one processor 310; wherein the memory stores a program of instructions executable by at least one processor, the program of instructions being at least A processor executes to enable at least one processor to perform the above-described method of robot voice instruction recognition.
处理器310和存储器320可以通过总线或者其他方式连接,图3中以通过总线连接为例。The processor 310 and the memory 320 may be connected by a bus or other means, as exemplified by a bus connection in FIG.
存储器320作为一种非易失性计算机可读存储介质,可用于存储非易失性软件程序、非易失性计算机可执行程序以及模块,如本申请实施例中的机器人语音指令识别的方法对应的程序指令/模块。处理器310 通过运行存储在存储器320中的非易失性软件程序、指令以及模块,从而执行机器人装置的各种功能应用以及数据处理,即实现上述方法实施例的应用于机器人装置的机器人语音指令识别的方法。The memory 320 is used as a non-volatile computer readable storage medium, and can be used for storing a non-volatile software program, a non-volatile computer executable program, and a module, and the method for recognizing the robot voice instruction in the embodiment of the present application corresponds to Program instructions/modules. Processor 310 Performing various functional applications and data processing of the robot apparatus by executing non-volatile software programs, instructions, and modules stored in the memory 320, that is, a method for realizing robot voice instruction recognition applied to the robot apparatus of the above method embodiment .
存储器320可以包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需要的应用程序;存储数据区可存储上述机器人语音指令识别的方法的使用所创建的数据等。此外,存储器320可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件、闪存器件、或其他非易失性固态存储器件。在一些实施例中,存储器320可包括相对于处理器310远程设置的存储器,这些远程存储器可以通过网络连接至机器人装置。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。The memory 320 may include a storage program area and an storage data area, wherein the storage program area may store an operating system, an application required for at least one function; the storage data area may store data created by use of the method of the above-described robot voice instruction identification, etc. . Moreover, memory 320 can include high speed random access memory, and can also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid state storage device. In some embodiments, memory 320 can include memory remotely located relative to processor 310, which can be connected to the robotic device over a network. Examples of such networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
一个或者多个模块存储在存储器320中,当被一个或者多个处理器310执行时,执行上述任意方法实施例中的应用于机器人装置的机器人语音指令识别的方法。One or more modules are stored in memory 320, and when executed by one or more processors 310, perform the method of robotic voice instruction recognition applied to the robotic device in any of the method embodiments described above.
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统,装置和单元的具体工作过程,可以参考上述方法实施例中的对应过程,在此不再赘述。A person skilled in the art can clearly understand that for the convenience and brevity of the description, the specific working process of the system, the device and the unit described above can refer to the corresponding process in the foregoing method embodiment, and details are not described herein again.
在本申请所提供的几个实施例中,应该理解到,所揭露的系统,装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided by the present application, it should be understood that the disclosed system, apparatus, and method may be implemented in other manners. For example, the device embodiments described above are merely illustrative. For example, the division of the unit is only a logical function division. In actual implementation, there may be another division manner, for example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored or not executed. In addition, the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be in an electrical, mechanical or other form.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit. The above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
以上所述仅为本发明的实施方式,并非因此限制本发明的专利范围,凡是利用本发明说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本发明的专利保护范围内。 The above is only the embodiment of the present invention, and is not intended to limit the scope of the invention, and the equivalent structure or equivalent process transformations made by the description of the invention and the drawings are directly or indirectly applied to other related technologies. The fields are all included in the scope of patent protection of the present invention.

Claims (15)

  1. 一种机器人语音指令识别的方法,其特征在于,包括:A method for recognizing a robot voice instruction, comprising:
    在检测到用户发出语音后,且在当前语音指令识别准确率低于预设的准确率阈值时,调整机器人的工作状态至机器人在调整后的语音指令识别准确率达到预设的准确率阈值。After detecting the user's voice, and when the current voice command recognition accuracy is lower than the preset accuracy threshold, the working state of the robot is adjusted until the adjusted voice command recognition accuracy of the robot reaches a preset accuracy threshold.
  2. 根据权利要求1所述的方法,其特征在于,该方法还包括:The method of claim 1 further comprising:
    获取当前噪声等级;Get the current noise level;
    根据所述当前噪声等级,按照噪声等级与识别准确率的对应关系确定当前语音指令识别准确率。And determining, according to the current noise level, a current voice instruction recognition accuracy rate according to a correspondence relationship between the noise level and the recognition accuracy rate.
  3. 根据权利要求1所述的方法,其特征在于,所述预设的准确率阈值与所述机器人当前执行任务的优先级对应;执行任务的优先级越高,对应的准确率阈值越低。The method according to claim 1, wherein the preset accuracy rate threshold corresponds to a priority of the currently executing task of the robot; and the higher the priority of executing the task, the lower the corresponding accuracy rate threshold.
  4. 根据权利要求1所述的方法,其特征在于,所述调整机器人的工作状态至机器人在调整后的语音指令识别准确率达到预设的准确率阈值包括:The method according to claim 1, wherein the adjusting the working state of the robot until the adjusted accuracy of the adjusted voice command of the robot reaches a preset accuracy threshold comprises:
    根据所述当前语音指令识别准确率和所述预设的准确率阈值的差值,确定对机器人的工作状态的调整幅度;Determining an adjustment range of the working state of the robot according to the difference between the current voice instruction identification accuracy rate and the preset accuracy rate threshold value;
    按照所确定的调整幅度调整机器人的动作,使机器人在调整后的语音指令识别准确率达到预设的准确率阈值。The action of the robot is adjusted according to the determined adjustment range, so that the accuracy of the adjusted voice command recognition of the robot reaches a preset accuracy threshold.
  5. 根据权利要求4所述的方法,其特征在于,所述方法还包括:The method of claim 4, wherein the method further comprises:
    在机器人在调整后的语音指令识别准确率未达到预设的准确率阈值时,告知用户当前环境噪声较大或向用户靠近。When the adjusted accuracy of the voice command recognition of the robot does not reach the preset accuracy threshold, the user is informed that the current environmental noise is large or close to the user.
  6. 根据权利要求5所述的方法,其特征在于,所述在机器人在调整后的语音指令识别准确率未达到预设的准确率阈值时,告知用户当前环境噪声较大或向用户靠近,是指:The method according to claim 5, wherein the robot informs the user that the current environmental noise is large or close to the user when the adjusted voice command recognition accuracy does not reach the preset accuracy threshold, :
    在机器人在调整后的语音指令识别准确率未达到预设的准确率阈值且当前执行任务的优先级低于预设优先级时,告知用户当前环境噪声较大或向用户靠近。 When the accuracy of the adjusted voice command recognition of the robot does not reach the preset accuracy threshold and the priority of the currently executed task is lower than the preset priority, the user is informed that the current environmental noise is large or close to the user.
  7. 根据权利要求1所述的方法,其特征在于,所述调整机器人的工作状态至机器人在调整后的语音指令识别准确率达到预设的准确率阈值包括:The method according to claim 1, wherein the adjusting the working state of the robot until the adjusted accuracy of the adjusted voice command of the robot reaches a preset accuracy threshold comprises:
    按照预设的调整幅度步进调整机器人的动作,至机器人在调整后的语音指令识别准确率达到预设的准确率阈值。The robot's motion is adjusted step by step according to the preset adjustment range, until the robot's adjusted voice command recognition accuracy reaches a preset accuracy threshold.
  8. 根据权利要求7所述的方法,其特征在于,所述方法还包括:The method of claim 7, wherein the method further comprises:
    在机器人在调整工作状态后停止动作时,语音指令识别准确率仍未达到预设的准确率阈值时,告知用户当前环境噪声较大或向用户靠近。When the robot stops moving after adjusting the working state, when the voice command recognition accuracy rate has not reached the preset accuracy rate threshold, the user is informed that the current environmental noise is large or close to the user.
  9. 根据权利要求1-8中任一项所述的方法,其特征在于,所述调整机器人的工作状态包括如下一种或者任意几种:放慢动作速度、降低马达转速、关闭非人声频段舵机。The method according to any one of claims 1-8, wherein the operating state of the adjusting robot comprises one or any of the following: releasing a slow motion speed, reducing a motor speed, and turning off a non-human sound band rudder. machine.
  10. 一种机器人语音指令识别装置,其特征在于,包括:A robot voice instruction recognition device, comprising:
    调整模块,用于在用户发出语音后,且在当前语音指令识别准确率低于预设的准确率阈值时,调整机器人的工作状态至机器人在调整后的语音指令识别准确率达到预设的准确率阈值。The adjustment module is configured to adjust the working state of the robot to the preset accuracy of the adjusted voice command after the voice is sent by the user and the current voice command recognition accuracy is lower than the preset accuracy threshold. Rate threshold.
  11. 根据权利要求10所述的装置,其特征在于,所述装置还包括:The device according to claim 10, wherein the device further comprises:
    获取模块,用于获取当前噪声等级;An acquisition module for obtaining a current noise level;
    准确率确定模块,用于根据所述当前噪声等级,按照噪声等级与识别准确率的对应关系确定当前语音指令识别准确率。The accuracy determination module is configured to determine, according to the current noise level, a current voice instruction recognition accuracy rate according to a correspondence between the noise level and the recognition accuracy rate.
  12. 根据权利要求10所述的装置,其特征在于,所述预设的准确率阈值与所述机器人当前执行任务的优先级对应;执行任务的优先级越高,对应的准确率阈值越低。The apparatus according to claim 10, wherein the preset accuracy rate threshold corresponds to a priority of the currently executing task of the robot; and the higher the priority of executing the task, the lower the corresponding accuracy rate threshold.
  13. 根据权利要求10所述的装置,其特征在于,所述调整模块包括:The apparatus according to claim 10, wherein the adjustment module comprises:
    幅度确定单元,用于根据所述当前语音指令识别准确率和所述预设的准确率阈值的差值,确定对机器人的工作状态的调整幅度;And an amplitude determining unit, configured to determine, according to the difference between the current voice instruction identification accuracy rate and the preset accuracy rate threshold, an adjustment range of the working state of the robot;
    第一调整单元,用于按照所确定的调整幅度调整机器人的动作,使机器人在调整后的语音指令识别准确率达到预设的准确率阈值。The first adjusting unit is configured to adjust the motion of the robot according to the determined adjustment range, so that the accuracy of the adjusted voice command recognition of the robot reaches a preset accuracy threshold.
  14. 根据权利要求10-13中任一项所述的装置,其特征在于,所述 调整机器人的工作状态包括如下一种或者任意几种:放慢动作速度、降低马达转速、关闭非人声频段舵机。Apparatus according to any one of claims 10-13, wherein said said Adjusting the working state of the robot includes one or any of the following: placing the slow motion speed, reducing the motor speed, and turning off the non-human voice frequency steering gear.
  15. 一种机器人装置,其特征在于,包括:A robot apparatus, comprising:
    至少一个处理器;以及At least one processor;
    与所述至少一个处理器连接的存储器;其中,a memory coupled to the at least one processor; wherein
    所述存储器存储有可被所述至少一个处理器执行的指令程序,所述指令程序被所述至少一个处理器执行,以使所述至少一个处理器用于:The memory stores an instruction program executable by the at least one processor, the instruction program being executed by the at least one processor to cause the at least one processor to:
    在检测到用户发出语音后,且在当前语音指令识别准确率低于预设的准确率阈值时,调整机器人的工作状态至机器人在调整后的语音指令识别准确率达到预设的准确率阈值。 After detecting the user's voice, and when the current voice command recognition accuracy is lower than the preset accuracy threshold, the working state of the robot is adjusted until the adjusted voice command recognition accuracy of the robot reaches a preset accuracy threshold.
PCT/CN2016/106118 2016-11-16 2016-11-16 Voice instruction recognition method for robot, and related robot device WO2018090252A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201680002660.0A CN106796790B (en) 2016-11-16 2016-11-16 Robot voice instruction recognition method and related robot device
PCT/CN2016/106118 WO2018090252A1 (en) 2016-11-16 2016-11-16 Voice instruction recognition method for robot, and related robot device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2016/106118 WO2018090252A1 (en) 2016-11-16 2016-11-16 Voice instruction recognition method for robot, and related robot device

Publications (1)

Publication Number Publication Date
WO2018090252A1 true WO2018090252A1 (en) 2018-05-24

Family

ID=58952335

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/106118 WO2018090252A1 (en) 2016-11-16 2016-11-16 Voice instruction recognition method for robot, and related robot device

Country Status (2)

Country Link
CN (1) CN106796790B (en)
WO (1) WO2018090252A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115171284A (en) * 2022-07-01 2022-10-11 国网汇通金财(北京)信息科技有限公司 Old people care method and device

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107820619B (en) * 2017-09-21 2019-12-10 达闼科技(北京)有限公司 hierarchical interaction decision-making method, interaction terminal and cloud server
CN111968643A (en) * 2017-09-29 2020-11-20 赵成智 Intelligent recognition method, robot and computer readable storage medium
CN109994111B (en) * 2019-02-26 2021-11-23 维沃移动通信有限公司 Interaction method, interaction device and mobile terminal
CN110288991B (en) * 2019-06-18 2022-02-18 北京梧桐车联科技有限责任公司 Voice recognition method and device
CN114536363B (en) * 2022-02-25 2024-06-04 杭州萤石软件有限公司 Robot control method, robot system and robot

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1397063A (en) * 2000-11-27 2003-02-12 皇家菲利浦电子有限公司 Method for control of unit comprising acoustic output device
CN104064185A (en) * 2013-03-18 2014-09-24 联想(北京)有限公司 Information processing method and system and electronic device
CN104078040A (en) * 2014-06-26 2014-10-01 美的集团股份有限公司 Voice recognition method and system

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3493849B2 (en) * 1995-11-20 2004-02-03 ソニー株式会社 Voice recognition device
US20080147411A1 (en) * 2006-12-19 2008-06-19 International Business Machines Corporation Adaptation of a speech processing system from external input that is not directly related to sounds in an operational acoustic environment
JP5075664B2 (en) * 2008-02-15 2012-11-21 株式会社東芝 Spoken dialogue apparatus and support method
US9293130B2 (en) * 2008-05-02 2016-03-22 Nuance Communications, Inc. Method and system for robust pattern matching in continuous speech for spotting a keyword of interest using orthogonal matching pursuit
KR101622604B1 (en) * 2009-05-19 2016-05-31 엘지전자 주식회사 Mobile terminal and method for processing process thereof
US8311820B2 (en) * 2010-01-28 2012-11-13 Hewlett-Packard Development Company, L.P. Speech recognition based on noise level
US8438023B1 (en) * 2011-09-30 2013-05-07 Google Inc. Warning a user when voice input to a device is likely to fail because of background or other noise
CN102831894B (en) * 2012-08-09 2014-07-09 华为终端有限公司 Command processing method, command processing device and command processing system
WO2014068788A1 (en) * 2012-11-05 2014-05-08 三菱電機株式会社 Speech recognition device
CN103971680B (en) * 2013-01-24 2018-06-05 华为终端(东莞)有限公司 A kind of method, apparatus of speech recognition
CN103065631B (en) * 2013-01-24 2015-07-29 华为终端有限公司 A kind of method of speech recognition, device
GB2516075B (en) * 2013-07-10 2018-08-22 Cirrus Logic Int Semiconductor Ltd Sensor input recognition
CN103916875B (en) * 2014-04-24 2018-04-06 山东大学 Management and planning system based on WIFI wireless networks multiclass control terminal
CN103928026B (en) * 2014-05-12 2017-04-12 安徽江淮汽车集团股份有限公司 Automobile voice command acquiring and processing system and method
CN104505092B (en) * 2014-12-10 2017-12-22 广东美的制冷设备有限公司 Air conditioner sound control method and system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1397063A (en) * 2000-11-27 2003-02-12 皇家菲利浦电子有限公司 Method for control of unit comprising acoustic output device
CN104064185A (en) * 2013-03-18 2014-09-24 联想(北京)有限公司 Information processing method and system and electronic device
CN104078040A (en) * 2014-06-26 2014-10-01 美的集团股份有限公司 Voice recognition method and system

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115171284A (en) * 2022-07-01 2022-10-11 国网汇通金财(北京)信息科技有限公司 Old people care method and device
CN115171284B (en) * 2022-07-01 2023-12-26 国网汇通金财(北京)信息科技有限公司 Senior caring method and device

Also Published As

Publication number Publication date
CN106796790A (en) 2017-05-31
CN106796790B (en) 2020-11-10

Similar Documents

Publication Publication Date Title
WO2018090252A1 (en) Voice instruction recognition method for robot, and related robot device
CN108231079B (en) Method, apparatus, device and computer-readable storage medium for controlling electronic device
US9977954B2 (en) Robot cleaner and method for controlling a robot cleaner
US20210365030A1 (en) Obstacle-avoidance moving method of self-moving robot
US10453457B2 (en) Method for performing voice control on device with microphone array, and device thereof
WO2019071989A1 (en) Smart device speech enhancement method and device and smart device
CN105116994A (en) Intelligent robot tracking method and tracking device based on artificial intelligence
US9477217B2 (en) Using visual cues to improve appliance audio recognition
WO2018049782A1 (en) Household appliance control method, device and system, and intelligent air conditioner
JP2018005881A (en) Face detection tracking method, robot head pivoting control method, and robot
JP2019532543A (en) Control system and control processing method and apparatus
CN106548231B (en) Mobile control device, mobile robot and method for moving to optimal interaction point
RU2013140687A (en) GESTURE MANAGED SYSTEM THAT USES PROPIOCCEPTION TO CREATE AN ABSOLUTE COORDINATE SYSTEM
RU2015137291A (en) METHOD AND DEVICE FOR MANAGING INTELLECTUAL HOUSING DEVICE
US10793993B2 (en) Device and method for controlling automatic opening and closing of upper cover of washing machine
TW201941104A (en) Control method for smart device, apparatus, device, and storage medium
CN105260022B (en) A kind of method and device based on gesture control sectional drawing
US20130308835A1 (en) Mobile Communication Device with Image Recognition and Method of Operation Therefor
CN108375911B (en) Equipment control method and device, storage medium and equipment
CN104683847A (en) Television capable of controlling volume intelligently and method
CN104375627A (en) Method of adjusting volume and device thereof
WO2017148006A1 (en) Terminal control method and device, terminal, and computer storage medium
KR20190001067A (en) Method and apparatus for speech recognition
WO2018068422A1 (en) Method and system for improving kitchen ventilator voice recognition rate
US11426874B2 (en) Robot-based insertion mounting of workpieces

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16921500

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS (EPO FORM 1205A DATED 14.10.2019)

122 Ep: pct application non-entry in european phase

Ref document number: 16921500

Country of ref document: EP

Kind code of ref document: A1