WO2017000786A1 - System and method for training robot via voice - Google Patents

System and method for training robot via voice Download PDF

Info

Publication number
WO2017000786A1
WO2017000786A1 PCT/CN2016/085911 CN2016085911W WO2017000786A1 WO 2017000786 A1 WO2017000786 A1 WO 2017000786A1 CN 2016085911 W CN2016085911 W CN 2016085911W WO 2017000786 A1 WO2017000786 A1 WO 2017000786A1
Authority
WO
WIPO (PCT)
Prior art keywords
statement
preset
conditional statement
robot
training
Prior art date
Application number
PCT/CN2016/085911
Other languages
French (fr)
Chinese (zh)
Inventor
蔡明峻
Original Assignee
芋头科技(杭州)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 芋头科技(杭州)有限公司 filed Critical 芋头科技(杭州)有限公司
Publication of WO2017000786A1 publication Critical patent/WO2017000786A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Manipulator (AREA)
  • Electrically Operated Instructional Devices (AREA)
  • Toys (AREA)

Abstract

Disclosed are a system and method for training a robot via voice. The system for training a robot via voice comprises: a receiver unit used for receiving a voice signal; a parsing unit connected to the receiver unit and used for parsing the voice signal, matching the voice signal with a preset statement, and acquiring a conditional statement matching the preset statement and corresponding to the voice signal and an execution statement corresponding to the voice signal; a processing unit connected to the parsing unit and used for combining the conditional statement with the execution statement to produce a target entry; and a storage unit connected to the processing unit and used for storing a preset entry and training a robot on the basis of the preset entry. The processing unit performs a weighted calculation on the basis of the target entry and processes correspondingly on the basis of the result of the weighted calculation.

Description

一种通过语音对机器人进行训练的系统及方法System and method for training robot by voice 技术领域Technical field
本发明涉及机器人领域,尤其涉及一种通过语音对机器人进行训练的系统及方法。The present invention relates to the field of robots, and more particularly to a system and method for training a robot by voice.
背景技术Background technique
目前对机器人行为进行训练的方法仅限于使用编程开发的方式来对机器人的逻辑进行修改,开发者通过修改机器人的程序逻辑,完成在满足某项条件下执行某种动作的设定。这种训练方式对于机器人底层开发是必须的,但进入上层逻辑开发时,则出现开发效率低,错误率高等缺陷;这种训练方式不适用于不具备编程开发专业技能的普通用户,如果普通用户想对机器人的行为做少许修改,则需要耗费大量的时间进行学习。At present, the method of training robot behavior is limited to the use of programming development to modify the logic of the robot. The developer modifies the program logic of the robot to complete the setting of performing certain actions under certain conditions. This training method is necessary for the underlying development of the robot, but when it enters the upper logic development, it has the defects of low development efficiency and high error rate; this training method is not suitable for ordinary users who do not have the professional skills of programming development, if ordinary users If you want to make a small change to the behavior of the robot, it will take a lot of time to learn.
综上所述,上述训练方法适用范围窄、效率低且错误率高。In summary, the above training method has a narrow application range, low efficiency, and high error rate.
发明内容Summary of the invention
针对现有的对机器人进行训练的方法存在的上述问题,现提供一种旨在实现支持没有编程开发基础的用户通过语音对机器人进行训练的系统及方法。In view of the above problems existing in the existing methods of training robots, there is now provided a system and method for implementing support for a robot that supports a robot without speech based on programming.
具体技术方案如下: The specific technical solutions are as follows:
一种通过语音对机器人进行训练的系统,包括:A system for training robots by voice, including:
一接收单元,用于接收语音信号;a receiving unit, configured to receive a voice signal;
一解析单元,连接所述接收单元,用于对所述语音信号进行解析,将所述语音信号与预设语句进行匹配,获取与所述预设语句匹配的且与所述语音信号对应的条件语句,及与所述语音信号对应的执行语句;An analyzing unit, configured to connect the receiving unit, to parse the voice signal, match the voice signal with a preset statement, and acquire a condition that matches the preset statement and corresponds to the voice signal a statement, and an execution statement corresponding to the voice signal;
一处理单元,连接所述解析单元,用于将所述条件语句与所述执行语句结合生成一目标条目;a processing unit, coupled to the parsing unit, configured to combine the conditional statement with the execution statement to generate a target entry;
一存储单元,连接所述处理单元,用以存储预设条目,根据所述预设条目对机器人进行训练;a storage unit, connected to the processing unit, for storing a preset item, and training the robot according to the preset item;
所述处理单元根据所述目标条目进行权重计算,并根据所述权重计算结果进行相应的处理。The processing unit performs weight calculation according to the target item, and performs corresponding processing according to the weight calculation result.
优选的,所述解析单元包括:Preferably, the parsing unit comprises:
一第一转换模块,用于将所述语音信号转换为文字信息;a first conversion module, configured to convert the voice signal into text information;
一语义分析模块,连接所述第一转换模块,用于对所述文字信息进行解析,将所述文字信息与所述预设语句进行匹配,获取与所述预设语句匹配的且与所述文字信息对应的条件语句,并识别所述条件语句是标准式条件语句或反馈式条件语句;a semantic analysis module, configured to connect the first conversion module, to parse the text information, match the text information with the preset statement, and obtain a match with the preset statement and a conditional statement corresponding to the text information, and identifying that the conditional statement is a standard conditional statement or a feedback conditional statement;
若所述条件语句是标准式条件语句,则获取与所述文件信息对应的执行语句;If the conditional statement is a standard conditional statement, acquiring an execution statement corresponding to the file information;
若所述条件语句是反馈式条件语句,则进行权重运算,使所述机器人执行上一次任务的操作。If the conditional statement is a feedback conditional statement, a weighting operation is performed to cause the robot to perform an operation of the last task.
优选的,所述解析单元还包括:Preferably, the parsing unit further includes:
一第二转换模块,连接所述语义分析模块,用于将所述执行语句转换为相应的音频信号,并输出。A second conversion module is connected to the semantic analysis module for converting the execution statement into a corresponding audio signal and outputting.
优选的,每一条所述预设条目包括预设条件语句和预设执行语句。Preferably, each of the preset items includes a preset condition statement and a preset execution statement.
优选的,所述处理单元根据所述目标条目中的所述条件语句,遍历所述 存储单元中的所有所述预设条目中的所述预设条件语句,以获取所述条件语句是否与所述预设条件语句重复,若不重复,则进行所述权重运算,并将所述目标条目存储于所述存储单元中以形成新的所述预设条目,根据所述预设条目对机器人进行训练;若重复则进行所述权重运算,并根据所述权重计算结果进行相应的处理。Preferably, the processing unit traverses the condition according to the conditional statement in the target entry Storing the preset conditional statement in all the preset entries in the unit to obtain whether the conditional statement is repeated with the preset conditional statement, if not, performing the weighting operation, and Target items are stored in the storage unit to form a new preset item, and the robot is trained according to the preset item; if repeated, the weighting operation is performed, and corresponding processing is performed according to the weight calculation result .
一种通过语音对机器人进行训练的方法,包括下述步骤:A method of training a robot by voice, comprising the following steps:
S1.采集语音信号;S1. collecting a voice signal;
S2.对所述语音信号进行解析,将所述语音信号与预设语句进行匹配,获取与所述预设语句匹配的且与所述语音信号对应的条件语句,及与所述语音信号对应的执行语句;S2. Parsing the voice signal, matching the voice signal with a preset statement, acquiring a conditional statement matching the preset statement and corresponding to the voice signal, and corresponding to the voice signal Execute statement
S3.将所述条件语句与所述执行语句结合生成一目标条目;S3. Combining the conditional statement with the execution statement to generate a target entry;
S4.根据所述目标条目进行权重计算,并根据所述权重计算结果进行相应的处理。S4. Perform weight calculation according to the target item, and perform corresponding processing according to the weight calculation result.
优选的,所述步骤S2具体包括:Preferably, the step S2 specifically includes:
S21.将所述语音信号转换为文字信息;S21. Converting the voice signal into text information;
S22.对所述文字信息进行解析,将所述文字信息与所述预设语句进行匹配,获取与所述预设语句匹配的且与所述文字信息对应的条件语句,并识别所述条件语句是标准式条件语句或反馈式条件语句;S22. Parsing the text information, matching the text information with the preset statement, acquiring a conditional statement matching the preset statement and corresponding to the text information, and identifying the conditional statement Is a standard conditional statement or a feedback conditional statement;
若所述条件语句是标准式条件语句,则获取与所述文件信息对应的执行语句;If the conditional statement is a standard conditional statement, acquiring an execution statement corresponding to the file information;
若所述条件语句是反馈式条件语句,则进行权重运算,使所述机器人执行上一次任务的操作。If the conditional statement is a feedback conditional statement, a weighting operation is performed to cause the robot to perform an operation of the last task.
优选的,所述步骤S2还包括:Preferably, the step S2 further includes:
S23.将所述执行语句转换为相应的音频信号,并输出。S23. Convert the execution statement into a corresponding audio signal and output.
优选的,每一条所述预设条目包括预设条件语句和预设执行语句。Preferably, each of the preset items includes a preset condition statement and a preset execution statement.
优选的,所述步骤S3具体包括: Preferably, the step S3 specifically includes:
S31.根据所述目标条目中的所述条件语句,遍历所述存储单元中的所有所述预设条目中的所述预设条件语句;S31. Traverse the preset conditional statement in all the preset entries in the storage unit according to the conditional statement in the target entry;
S32.获取遍历结果,判断所述条件语句是否与所述预设条件语句重复,S32. Acquire a traversal result, and determine whether the conditional statement is repeated with the preset conditional statement.
若所述条件语句与所述预设条件语句不重复,则执行步骤S33;If the conditional statement does not overlap with the preset conditional statement, step S33 is performed;
若所述条件语句与所述预设条件语句重复,则执行步骤S34;If the conditional statement is repeated with the preset conditional statement, step S34 is performed;
S33.进行所述权重运算,并将所述目标条目存储于所述存储单元中以形成新的所述预设条目,根据所述预设条目对机器人进行训练;S33. Perform the weighting operation, and store the target entry in the storage unit to form a new preset entry, and train the robot according to the preset entry;
S34.进行所述权重运算,并根据所述权重计算结果进行相应的处理。S34. Perform the weighting operation, and perform corresponding processing according to the weighting calculation result.
上述技术方案的有益效果:The beneficial effects of the above technical solutions:
本技术方案中,在通过语音对机器人进行训练的系统中,通过解析单元对语音信号进行解析获取相应的条件语句和执行语句,通过处理单元将条件语句和执行语句结合生成条目,使机器人根据条目进行相应的训练,效率高且错误率低。在通过语音对机器人进行训练的方法中,只需用户输入语音信号即可对机器人进行训练,操作简单,适用范围广且效率高。In the technical solution, in the system for training the robot by voice, the parsing unit parses the speech signal to obtain a corresponding conditional statement and an execution statement, and the processing unit combines the conditional statement and the execution statement to generate an entry, so that the robot according to the entry The corresponding training is carried out with high efficiency and low error rate. In the method of training the robot by voice, the robot can be trained only by inputting a voice signal, and the operation is simple, and the scope of application is wide and the efficiency is high.
附图说明DRAWINGS
图1为本发明所述通过语音对机器人进行训练的系统的一种实施例的模块图;1 is a block diagram of an embodiment of a system for training a robot by voice according to the present invention;
图2为本发明所述通过语音对机器人进行训练的方法的一种实施的流程图;2 is a flow chart of an implementation of a method for training a robot by voice according to the present invention;
图3为对语音信号进行解析的方法流程图;3 is a flow chart of a method for parsing a voice signal;
图4为根据遍历结果对所述目标条目进行相应的处理的方法流程图。4 is a flow chart of a method for performing corresponding processing on the target entry according to a traversal result.
具体实施方式 detailed description
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动的前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention are clearly and completely described in the following with reference to the accompanying drawings in the embodiments of the present invention. It is obvious that the described embodiments are only a part of the embodiments of the present invention, but not all embodiments. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.
需要说明的是,在不冲突的情况下,本发明中的实施例及实施例中的特征可以相互组合。It should be noted that the embodiments in the present invention and the features in the embodiments may be combined with each other without conflict.
下面结合附图和具体实施例对本发明作进一步说明,但不作为本发明的限定。The invention is further illustrated by the following figures and specific examples, but is not to be construed as limiting.
如图1所示,一种通过语音对机器人进行训练的系统,包括:As shown in FIG. 1, a system for training a robot by voice, comprising:
一接收单元1,用于接收语音信号;a receiving unit 1 for receiving a voice signal;
一解析单元2,连接接收单元1,用于对语音信号进行解析,将语音信号与预设语句进行匹配,获取与预设语句匹配的且与语音信号对应的条件语句,及与语音信号对应的执行语句;An analyzing unit 2 is connected to the receiving unit 1 for parsing the voice signal, matching the voice signal with the preset statement, acquiring a conditional statement matching the preset sentence and corresponding to the voice signal, and corresponding to the voice signal Execute statement
一处理单元3,连接解析单元2,用于将条件语句与执行语句结合生成一目标条目;a processing unit 3, connected to the parsing unit 2, for combining a conditional statement with an execution statement to generate a target entry;
一存储单元4,连接处理单元3,用以存储预设条目,根据预设条目对机器人进行训练;a storage unit 4, connected to the processing unit 3, for storing a preset item, and training the robot according to the preset item;
处理单元3根据目标条目进行权重计算,并根据权重计算结果进行相应的处理。The processing unit 3 performs weight calculation according to the target entry, and performs corresponding processing according to the weight calculation result.
在本实施例中,采用语音对机器人进行训练的系统可应用于儿童类玩具中,虽然儿童不具备专业的编程开发技能,但儿童可以通过自然语言与机器人交流,并训练机器人执行相应的动作。In the present embodiment, the system for training robots by voice can be applied to children's toys. Although children do not have professional programming development skills, children can communicate with robots through natural language and train robots to perform corresponding actions.
在本实施例中,针对机器人行为逻辑开发的优化过程,选择了适合普通用户与机器人进行交互的方式,使用户在对机器人进行训练的过程专注于训 练逻辑本身,而非开发语言,提高了工作效率且降低了错误率。通过解析单元2对语音信号进行解析获取相应的条件语句和执行语句,通过处理单元3将条件语句和执行语句结合生成条目,使机器人根据条目进行相应的训练,效率高且错误率低。In this embodiment, for the optimization process of the robot behavior logic development, a method suitable for the interaction between the ordinary user and the robot is selected, so that the user concentrates on the training process of the robot. Practicing logic itself, rather than developing languages, increases productivity and reduces error rates. The parsing unit 2 parses the speech signal to obtain a corresponding conditional statement and an execution statement, and the processing unit 3 combines the conditional statement and the execution statement to generate an item, so that the robot performs corresponding training according to the item, and the efficiency is high and the error rate is low.
在优选的实施例中,解析单元2包括:In a preferred embodiment, the parsing unit 2 comprises:
一第一转换模块21,用于将语音信号转换为文字信息;a first conversion module 21, configured to convert the voice signal into text information;
一语义分析模块22,连接第一转换模块21,用于对文字信息进行解析,将文字信息与预设语句进行匹配,获取与预设语句匹配的且与文字信息对应的条件语句,并识别条件语句是标准式条件语句或反馈式条件语句;a semantic analysis module 22 is connected to the first conversion module 21 for parsing the text information, matching the text information with the preset statement, acquiring a conditional statement matching the preset sentence and corresponding to the text information, and identifying the condition The statement is a standard conditional statement or a feedback conditional statement;
若条件语句是标准式条件语句,则获取与文件信息对应的执行语句;If the conditional statement is a standard conditional statement, obtaining an execution statement corresponding to the file information;
若条件语句是反馈式条件语句,则进行权重运算,使机器人执行上一次任务的操作。If the conditional statement is a feedback conditional statement, a weighting operation is performed to cause the robot to perform the operation of the previous task.
在本实施例中,目标条目对应的句式可以是:In this embodiment, the sentence pattern corresponding to the target item may be:
当A时,就B;When A, then B;
如果A时,则B;If A, then B;
不要再在A时,做B;Don't do B when you are at A;
这个时候应该做B;This time should be done B;
这是错误的;This is wrong;
这样做是不对的等。This is not the right thing to do.
其中,“当A时”,“如果A时”,“不要再在A时”,以及“这个时候”均为标准式条件语句;“这是错误的”和“这样做是不对的”均为反馈式条件语句。Among them, "when A", "if A", "don't be at A", and "this time" are standard conditional statements; "this is wrong" and "this is wrong" Feedback conditional statement.
采用语音对机器人进行训练的系统的整个训练过程为:当识别到训练关键句式时机器人进入训练模式,用户可使用与上述相似的句式与机器人对话时,通过解析单元2的语义分析模块22将用户说的话划分为部分A和部分B,经过语义转换,将部分A转换为条件开发语句,将部分B转换为执行动 作开发语句,把部分A和部分B的关联关系追加到本地的训练知识库(存储单元4),并将部分A和部分B结合形成新的条目,如果部分A与训练知识库中的条件开发语句相同,部分B与训练知识库中相应的执行动作开发语句不同,则部分A为两条条件一样但执行不同动作的知识条目,需进行权重运算,权重运算包含用户的正负反馈,追加时间进行考量,并将新知识条目追加到本地知识库,并更新训练知识库。当识别到普通自然语言交流时,训练模式结束,机器人结束训练回归到轮询判断模式,历遍训练知识库中的所有条目,当命中某一条知识条目时,则执行知识条目中所包含的执行动作开发语句。The whole training process of the system for training the robot by using the voice is: when the training key sentence pattern is recognized, the robot enters the training mode, and the user can use the sentence pattern similar to the above to talk to the robot, and the semantic analysis module 22 of the parsing unit 2 The words spoken by the user are divided into part A and part B. After semantic conversion, part A is converted into conditional development statement, and part B is converted into execution. As a development statement, the relationship between Part A and Part B is added to the local training knowledge base (storage unit 4), and part A and part B are combined to form a new item, if part A and conditional development in the training knowledge base The statement is the same, part B is different from the corresponding execution action development statement in the training knowledge base, then part A is the knowledge item with the same two conditions but performing different actions, and the weight operation is required. The weight operation includes the user's positive and negative feedback, and the additional time Consider and append new knowledge items to the local knowledge base and update the training knowledge base. When the normal natural language communication is recognized, the training mode ends, the robot ends the training and returns to the polling judgment mode, and all the entries in the knowledge base are trained, and when a certain knowledge item is hit, the execution included in the knowledge item is executed. Action development statement.
在本实施例中,第一转换模块21可采用自动语音识别(Automatic Speech Recognition,ASR)技术,ASR技术可将人类语音中的词汇内容转换为计算机可读的内容并输入计算机,并且与计算机进行交互。In this embodiment, the first conversion module 21 can adopt Automatic Speech Recognition (ASR) technology, which can convert vocabulary content in human speech into computer readable content and input it into a computer, and perform with a computer. Interaction.
语义分析模块22采用人工智能的自然语言处理(Natural Language Processing,NLP)技术,通过NLP技术获取文字信息中的条件语句和执行语句。The semantic analysis module 22 uses artificial intelligence natural language processing (NLP) technology to acquire conditional statements and execution statements in the text information through the NLP technology.
在优选的实施例中,解析单元2还包括:In a preferred embodiment, the parsing unit 2 further includes:
一第二转换模块23,连接语义分析模块22,用于将执行语句转换为相应的音频信号,并输出。A second conversion module 23 is coupled to the semantic analysis module 22 for converting the execution statement into a corresponding audio signal and outputting it.
在本实施例中,第二转换模块23采用TTS(Text To Speech)即将文本转换为语音技术,该技术是人机对话的一部分,通过TTS使机器人能够说话。In this embodiment, the second conversion module 23 uses TTS (Text To Speech) to convert the text into a voice technology, which is part of the human-machine dialogue, and enables the robot to speak through the TTS.
在优选的实施例中,每一条预设条目包括预设条件语句和预设执行语句。In a preferred embodiment, each of the preset entries includes a preset conditional statement and a preset execution statement.
在优选的实施例中,处理单元3根据目标条目中的条件语句,遍历存储单元中的所有预设条目中的预设条件语句,以获取条件语句是否与预设条件语句重复,若不重复,则进行权重运算,并将目标条目存储于存储单元中4以形成新的预设条目,根据预设条目对机器人进行训练;若重复则进行权重运算,并根据权重计算结果进行相应的处理。 In a preferred embodiment, the processing unit 3 traverses the preset conditional statements in all the preset entries in the storage unit according to the conditional statement in the target entry to obtain whether the conditional statement is repeated with the preset conditional statement. If not, Then, a weight operation is performed, and the target item is stored in the storage unit 4 to form a new preset item, and the robot is trained according to the preset item; if it is repeated, the weight calculation is performed, and the corresponding calculation is performed according to the weight calculation result.
在本实施例中,可在追加新知识条目或原有知识条目后,当收到用户的正负反馈时,进行权重运算,整理整个训练知识库,进行压缩等工作,以保证机器人在条件轮询判断时的效率。In this embodiment, after adding a new knowledge item or an original knowledge item, when receiving the positive and negative feedback of the user, performing weight calculation, arranging the entire training knowledge base, performing compression, etc., to ensure that the robot is in the condition wheel. The efficiency of the judgment.
如图2所示,一种通过语音对机器人进行训练的方法,包括下述步骤:As shown in FIG. 2, a method for training a robot by voice includes the following steps:
S1.采集语音信号;S1. collecting a voice signal;
S2.对语音信号进行解析,将语音信号与预设语句进行匹配,获取与预设语句匹配的且与语音信号对应的条件语句,及与语音信号对应的执行语句;S2. parsing the voice signal, matching the voice signal with the preset statement, acquiring a conditional statement matching the preset statement and corresponding to the voice signal, and an execution statement corresponding to the voice signal;
S3.将条件语句与执行语句结合生成一目标条目;S3. Combining the conditional statement with the execution statement to generate a target entry;
S4.根据目标条目进行权重计算,并根据权重计算结果进行相应的处理。S4. Perform weight calculation according to the target item, and perform corresponding processing according to the weight calculation result.
在本实施例中,只需用户输入语音信号即可对机器人进行训练,操作简单,适用范围广且效率高。In this embodiment, the robot can be trained only by inputting a voice signal, and the operation is simple, the scope of application is wide, and the efficiency is high.
如图3所示,在优选的实施例中,步骤S2具体包括:As shown in FIG. 3, in a preferred embodiment, step S2 specifically includes:
S21.将语音信号转换为文字信息;S21. Converting the voice signal into text information;
S22.对文字信息进行解析,将文字信息与预设语句进行匹配,获取与预设语句匹配的且与文字信息对应的条件语句,并识别条件语句是标准式条件语句或反馈式条件语句;S22. Parsing the text information, matching the text information with the preset statement, obtaining a conditional statement matching the preset sentence and corresponding to the text information, and identifying that the conditional statement is a standard conditional statement or a feedback conditional statement;
若条件语句是标准式条件语句,则获取与文件信息对应的执行语句;If the conditional statement is a standard conditional statement, obtaining an execution statement corresponding to the file information;
若条件语句是反馈式条件语句,则进行权重运算,使机器人执行上一次任务的操作。If the conditional statement is a feedback conditional statement, a weighting operation is performed to cause the robot to perform the operation of the previous task.
在本实施例中,将语音信号转换为文字信息可采用自动语音识别(Automatic Speech Recognition,ASR)技术,ASR技术可将人类语音中的词汇内容转换为计算机可读的输入,并且与计算机进行交互。In this embodiment, the voice signal is converted into text information by using Automatic Speech Recognition (ASR) technology, which converts vocabulary content in human speech into computer readable input and interacts with a computer. .
对文字信息进行解析可采用人工智能的自然语言处理(Natural Language Processing,NLP)技术,通过NLP技术获取文字信息中的条件语句和执行语句。The text information can be parsed by the artificial language natural language processing (NLP) technology, and the conditional statements and execution statements in the text information are obtained by the NLP technology.
在优选的实施例中,步骤S2还包括: In a preferred embodiment, step S2 further includes:
S23.将执行语句转换为相应的音频信号,并输出。S23. Convert the execution statement into a corresponding audio signal and output it.
在本实施例中,采用TTS(Text To Speech,将文本转换为语音)技术将执行语句转换为相应的音频信号,该技术是人机对话的一部分,通过TTS使机器人能够说话。In this embodiment, the TTS (Text To Speech) technique is used to convert the execution statement into a corresponding audio signal, which is part of the human-machine dialogue, enabling the robot to speak through the TTS.
在优选的实施例中,每一条预设条目包括预设条件语句和预设执行语句。In a preferred embodiment, each of the preset entries includes a preset conditional statement and a preset execution statement.
如图4所示,在优选的实施例中,步骤S3具体包括:As shown in FIG. 4, in a preferred embodiment, step S3 specifically includes:
S31.根据目标条目中的条件语句,遍历存储单元中的所有预设条目中的预设条件语句;S31. traversing a preset conditional statement in all preset entries in the storage unit according to the conditional statement in the target entry;
S32.获取遍历结果,并进行权重计算S32. Acquire traversal results and perform weight calculation
判断条件语句是否与预设条件语句重复,Determine whether the conditional statement is a duplicate of a preset conditional statement.
若条件语句与预设条件语句不重复,则执行步骤S33;If the conditional statement does not overlap with the preset conditional statement, step S33 is performed;
若条件语句与预设条件语句重复,则执行步骤S34;If the conditional statement is repeated with the preset conditional statement, step S34 is performed;
S33.进行权重运算,并将目标条目存储于存储单元中以形成新的预设条目,根据预设条目对机器人进行训练;S33. Perform a weight operation, and store the target item in the storage unit to form a new preset item, and train the robot according to the preset item;
S34.进行权重运算,并根据权重计算结果进行相应的处理。S34. Perform a weighting operation and perform corresponding processing according to the weight calculation result.
在本实施例中,当机器人在下午的时候听到用户说“你好”时,用户训练机器人回复“XXX(人名),下午好”的训练步骤如下:In this embodiment, when the robot hears the user saying "hello" in the afternoon, the training steps of the user training robot to reply "XXX (person name), good afternoon" are as follows:
A1.用户对机器人说“你好”,“这个时候应该说,XXX,下午好”A1. The user said "Hello" to the robot, "This time should say, XXX, good afternoon"
A2.对用户说的内容进行语义解析,分离出说话内容中的执行语句即“说XXX,下午好”,“说”是对应机器人的TTS服务,“XXX”命中当前互动的用户的名字,“下午”命中当前的时间,“XXX,下午好”对应TTS服务的内容;A2. Perform semantic analysis on the content spoken by the user, and separate the execution statement in the spoken content, that is, “say XXX, good afternoon”, “say” is the TTS service corresponding to the robot, and “XXX” hits the name of the currently interacting user. In the afternoon, hit the current time, "XXX, good afternoon" corresponds to the content of the TTS service;
A3.根据语义解析结果生成新的知识库条目,判断权重后追加到本地知识库;A3. Generate a new knowledge base entry based on the semantic analysis result, and then add the weight to the local knowledge base after determining the weight;
A4.机器人执行新的知识库,结束;A4. The robot executes a new knowledge base and ends;
在完成这次互动训练后,当用户对机器人说“你好”时,机器人就会回 答“XXX,下午好”,从而达到预期训练目的。After completing this interactive training, when the user says "hello" to the robot, the robot will return Answer "XXX, good afternoon" to achieve the expected training objectives.
本发明可使用户在训练机器人时解放双手,在不需要写任何代码的情况下,实现对机器人行为的修正,使用户在训练过程中更专注于训练内容本身,而非如何编写代码等基础问题上。The invention enables the user to free hands when training the robot, and realizes the modification of the robot behavior without writing any code, so that the user concentrates more on the training content itself in the training process, instead of the basic problem of how to write the code. on.
以上所述仅为本发明较佳的实施例,并非因此限制本发明的实施方式及保护范围,对于本领域技术人员而言,应当能够意识到凡运用本发明说明书及图示内容所作出的等同替换和显而易见的变化所得到的方案,均应当包含在本发明的保护范围内。 The above is only a preferred embodiment of the present invention, and is not intended to limit the scope of the embodiments and the scope of the present invention, and those skilled in the art should be able to Alternatives and obvious variations are intended to be included within the scope of the invention.

Claims (10)

  1. 一种通过语音对机器人进行训练的系统,其特征在于,包括:A system for training a robot by voice, comprising:
    一接收单元,用于接收语音信号;a receiving unit, configured to receive a voice signal;
    一解析单元,连接所述接收单元,用于对所述语音信号进行解析,将所述语音信号与预设语句进行匹配,获取与所述预设语句匹配的且与所述语音信号对应的条件语句,及与所述语音信号对应的执行语句;An analyzing unit, configured to connect the receiving unit, to parse the voice signal, match the voice signal with a preset statement, and acquire a condition that matches the preset statement and corresponds to the voice signal a statement, and an execution statement corresponding to the voice signal;
    一处理单元,连接所述解析单元,用于将所述条件语句与所述执行语句结合生成一目标条目;a processing unit, coupled to the parsing unit, configured to combine the conditional statement with the execution statement to generate a target entry;
    一存储单元,连接所述处理单元,用以存储预设条目,根据所述预设条目对机器人进行训练;a storage unit, connected to the processing unit, for storing a preset item, and training the robot according to the preset item;
    所述处理单元根据所述目标条目进行权重计算,并根据所述权重计算结果进行相应的处理。The processing unit performs weight calculation according to the target item, and performs corresponding processing according to the weight calculation result.
  2. 如权利要求1所述的通过语音对机器人进行训练的系统,其特征在于,所述解析单元包括:The system for training a robot by voice according to claim 1, wherein the parsing unit comprises:
    一第一转换模块,用于将所述语音信号转换为文字信息;a first conversion module, configured to convert the voice signal into text information;
    一语义分析模块,连接所述第一转换模块,用于对所述文字信息进行解析,将所述文字信息与所述预设语句进行匹配,获取与所述预设语句匹配的且与所述文字信息对应的条件语句,并识别所述条件语句是标准式条件语句或反馈式条件语句;a semantic analysis module, configured to connect the first conversion module, to parse the text information, match the text information with the preset statement, and obtain a match with the preset statement and a conditional statement corresponding to the text information, and identifying that the conditional statement is a standard conditional statement or a feedback conditional statement;
    若所述条件语句是标准式条件语句,则获取与所述文件信息对应的执行语句;If the conditional statement is a standard conditional statement, acquiring an execution statement corresponding to the file information;
    若所述条件语句是反馈式条件语句,则进行权重运算,使所述机器人执行上一次任务的操作。If the conditional statement is a feedback conditional statement, a weighting operation is performed to cause the robot to perform an operation of the last task.
  3. 如权利要求2所述的通过语音对机器人进行训练的系统,其特征在于,所述解析单元还包括: The system for training a robot by voice according to claim 2, wherein the parsing unit further comprises:
    一第二转换模块,连接所述语义分析模块,用于将所述执行语句转换为相应的音频信号,并输出。A second conversion module is connected to the semantic analysis module for converting the execution statement into a corresponding audio signal and outputting.
  4. 如权利要求1所述的通过语音对机器人进行训练的系统,其特征在于,每一条所述预设条目包括预设条件语句和预设执行语句。The system for training a robot by voice according to claim 1, wherein each of said preset entries comprises a preset conditional statement and a preset execution statement.
  5. 如权利要求4所述的通过语音对机器人进行训练的系统,其特征在于,所述处理单元根据所述目标条目中的所述条件语句,遍历所述存储单元中的所有所述预设条目中的所述预设条件语句,以获取所述条件语句是否与所述预设条件语句重复,若不重复,则进行所述权重运算,并将所述目标条目存储于所述存储单元中以形成新的所述预设条目,根据所述预设条目对机器人进行训练;若重复则进行所述权重运算,并根据所述权重计算结果进行相应的处理。A system for training a robot by voice according to claim 4, wherein said processing unit traverses all of said preset entries in said storage unit according to said conditional statement in said target entry The preset conditional statement to obtain whether the conditional statement is repeated with the preset conditional statement, if not, performing the weighting operation, and storing the target item in the storage unit to form The new preset item is trained according to the preset item; if it is repeated, the weighting operation is performed, and corresponding processing is performed according to the weight calculation result.
  6. 一种通过语音对机器人进行训练的方法,其特征在于,包括下述步骤:A method for training a robot by voice, characterized in that it comprises the following steps:
    S1.采集语音信号;S1. collecting a voice signal;
    S2.对所述语音信号进行解析,将所述语音信号与预设语句进行匹配,获取与所述预设语句匹配的且与所述语音信号对应的条件语句,及与所述语音信号对应的执行语句;S2. Parsing the voice signal, matching the voice signal with a preset statement, acquiring a conditional statement matching the preset statement and corresponding to the voice signal, and corresponding to the voice signal Execute statement
    S3.将所述条件语句与所述执行语句结合生成一目标条目;S3. Combining the conditional statement with the execution statement to generate a target entry;
    S4.根据所述目标条目进行权重计算,并根据所述权重计算结果进行相应的处理。S4. Perform weight calculation according to the target item, and perform corresponding processing according to the weight calculation result.
  7. 如权利要求6所述通过语音对机器人进行训练的方法,其特征在于,所述步骤S2具体包括:The method of training a robot by voice according to claim 6, wherein the step S2 specifically includes:
    S21.将所述语音信号转换为文字信息;S21. Converting the voice signal into text information;
    S22.对所述文字信息进行解析,将所述文字信息与所述预设语句进行匹配,获取与所述预设语句匹配的且与所述文字信息对应的条件语句,并识别所述条件语句是标准式条件语句或反馈式条件语句;S22. Parsing the text information, matching the text information with the preset statement, acquiring a conditional statement matching the preset statement and corresponding to the text information, and identifying the conditional statement Is a standard conditional statement or a feedback conditional statement;
    若所述条件语句是标准式条件语句,则获取与所述文件信息对应的执行 语句;If the conditional statement is a standard conditional statement, obtaining execution corresponding to the file information Statement
    若所述条件语句是反馈式条件语句,则进行权重运算,使所述机器人执行上一次任务的操作。If the conditional statement is a feedback conditional statement, a weighting operation is performed to cause the robot to perform an operation of the last task.
  8. 如权利要求7所述通过语音对机器人进行训练的方法,其特征在于,所述步骤S2还包括:The method of training a robot by voice according to claim 7, wherein the step S2 further comprises:
    S23.将所述执行语句转换为相应的音频信号,并输出。S23. Convert the execution statement into a corresponding audio signal and output.
  9. 如权利要求6所述通过语音对机器人进行训练的方法,其特征在于,每一条所述预设条目包括预设条件语句和预设执行语句。A method for training a robot by voice according to claim 6, wherein each of said preset entries comprises a preset conditional statement and a preset execution statement.
  10. 如权利要求9所述通过语音对机器人进行训练的方法,其特征在于,所述步骤S3具体包括:The method of training a robot by voice according to claim 9, wherein the step S3 specifically includes:
    S31.根据所述目标条目中的所述条件语句,遍历所述存储单元中的所有所述预设条目中的所述预设条件语句;S31. Traverse the preset conditional statement in all the preset entries in the storage unit according to the conditional statement in the target entry;
    S32.获取遍历结果,判断所述条件语句是否与所述预设条件语句重复,S32. Acquire a traversal result, and determine whether the conditional statement is repeated with the preset conditional statement.
    若所述条件语句与所述预设条件语句不重复,则执行步骤S33;If the conditional statement does not overlap with the preset conditional statement, step S33 is performed;
    若所述条件语句与所述预设条件语句重复,则执行步骤S34;If the conditional statement is repeated with the preset conditional statement, step S34 is performed;
    S33.进行所述权重运算,并将所述目标条目存储于所述存储单元中以形成新的所述预设条目,根据所述预设条目对机器人进行训练;S33. Perform the weighting operation, and store the target entry in the storage unit to form a new preset entry, and train the robot according to the preset entry;
    S34.进行所述权重运算,并根据所述权重计算结果进行相应的处理。 S34. Perform the weighting operation, and perform corresponding processing according to the weighting calculation result.
PCT/CN2016/085911 2015-06-30 2016-06-15 System and method for training robot via voice WO2017000786A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510383547.9A CN106326208B (en) 2015-06-30 2015-06-30 A kind of system and method that robot is trained by voice
CN201510383547.9 2015-06-30

Publications (1)

Publication Number Publication Date
WO2017000786A1 true WO2017000786A1 (en) 2017-01-05

Family

ID=57607864

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/085911 WO2017000786A1 (en) 2015-06-30 2016-06-15 System and method for training robot via voice

Country Status (4)

Country Link
CN (1) CN106326208B (en)
HK (1) HK1231592A1 (en)
TW (1) TWI594136B (en)
WO (1) WO2017000786A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2021110870A1 (en) 2019-12-05 2021-06-10 Acib Gmbh Method for producing a fermentation product

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108172226A (en) * 2018-01-27 2018-06-15 上海萌王智能科技有限公司 A kind of voice control robot for learning response voice and action

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101075435A (en) * 2007-04-19 2007-11-21 深圳先进技术研究院 Intelligent chatting system and its realizing method
US20110144988A1 (en) * 2009-12-11 2011-06-16 Jongsuk Choi Embedded auditory system and method for processing voice signal
CN102254555A (en) * 2010-05-21 2011-11-23 索尼电脑娱乐公司 Speech recognition
CN202736475U (en) * 2011-12-08 2013-02-13 华南理工大学 Chat robot
CN103065629A (en) * 2012-11-20 2013-04-24 广东工业大学 Speech recognition system of humanoid robot

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060100855A1 (en) * 2004-10-27 2006-05-11 Rozen Suan D Disambiguation method for complex sentences
US7646857B2 (en) * 2005-05-19 2010-01-12 Verizon Business Global Llc Systems and methods for providing voicemail services including caller identification
CN103853646A (en) * 2012-12-04 2014-06-11 鸿富锦精密工业(武汉)有限公司 Called prompting system and method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101075435A (en) * 2007-04-19 2007-11-21 深圳先进技术研究院 Intelligent chatting system and its realizing method
US20110144988A1 (en) * 2009-12-11 2011-06-16 Jongsuk Choi Embedded auditory system and method for processing voice signal
CN102254555A (en) * 2010-05-21 2011-11-23 索尼电脑娱乐公司 Speech recognition
CN202736475U (en) * 2011-12-08 2013-02-13 华南理工大学 Chat robot
CN103065629A (en) * 2012-11-20 2013-04-24 广东工业大学 Speech recognition system of humanoid robot

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2021110870A1 (en) 2019-12-05 2021-06-10 Acib Gmbh Method for producing a fermentation product

Also Published As

Publication number Publication date
CN106326208B (en) 2019-06-07
CN106326208A (en) 2017-01-11
TWI594136B (en) 2017-08-01
TW201719452A (en) 2017-06-01
HK1231592A1 (en) 2017-12-22

Similar Documents

Publication Publication Date Title
KR102170563B1 (en) Human machine interactive method and apparatus based on artificial intelligence
TWI576825B (en) A voice recognition system of a robot system ?????????????????????????????? and method thereof
KR100679043B1 (en) Apparatus and method for spoken dialogue interface with task-structured frames
KR101211796B1 (en) Apparatus for foreign language learning and method for providing foreign language learning service
CN104538031B (en) Intelligent sound service development cloud platform and method
CN111402861B (en) Voice recognition method, device, equipment and storage medium
KR20160077190A (en) Natural expression processing method, processing and response method, device, and system
JP2017107078A (en) Voice interactive method, voice interactive device, and voice interactive program
US11393455B2 (en) Methods for natural language model training in natural language understanding (NLU) systems
US11626103B2 (en) Methods for natural language model training in natural language understanding (NLU) systems
US10607504B1 (en) Computer-implemented systems and methods for a crowd source-bootstrapped spoken dialog system
JP2014106523A (en) Voice input corresponding device and voice input corresponding program
CN105788596A (en) Speech recognition television control method and system
US20210271816A1 (en) Methods for natural language model training in natural language understanding (nlu) systems
US20210271819A1 (en) Methods for natural language model training in natural language understanding (nlu) systems
CN116821290A (en) Multitasking dialogue-oriented large language model training method and interaction method
TWI594136B (en) A system and method for training robots through voice
JP6306447B2 (en) Terminal, program, and system for reproducing response sentence using a plurality of different dialogue control units simultaneously
KR102564008B1 (en) Device and Method of real-time Speech Translation based on the extraction of translation unit
KR20210059995A (en) Method for Evaluating Foreign Language Speaking Based on Deep Learning and System Therefor
CN110809796B (en) Speech recognition system and method with decoupled wake phrases
KR102181583B1 (en) System for voice recognition of interactive robot and the method therof
US11176933B1 (en) Precomputed communication parameters
CN113539245B (en) Language model automatic training method and system
JP7363307B2 (en) Automatic learning device and method for recognition results in voice chatbot, computer program and recording medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16817150

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16817150

Country of ref document: EP

Kind code of ref document: A1