WO2021097822A1 - 一种可语言交互的机器人 - Google Patents
一种可语言交互的机器人 Download PDFInfo
- Publication number
- WO2021097822A1 WO2021097822A1 PCT/CN2019/120347 CN2019120347W WO2021097822A1 WO 2021097822 A1 WO2021097822 A1 WO 2021097822A1 CN 2019120347 W CN2019120347 W CN 2019120347W WO 2021097822 A1 WO2021097822 A1 WO 2021097822A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- module
- voice
- robot
- user
- voice recognition
- Prior art date
Links
- 230000003993 interaction Effects 0.000 title claims abstract description 13
- 230000000875 corresponding effect Effects 0.000 claims abstract description 15
- 238000001914 filtration Methods 0.000 claims abstract description 9
- 230000002452 interceptive effect Effects 0.000 description 2
- 238000000034 method Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
Definitions
- the invention relates to a robot capable of language interaction.
- Robot is a means for automatically performing work machine, may be assisted or substituted by human work, the language may be a voice interactive robot can receive the password corresponding to the operation performed, but the language may be a voice interactive robots prior designs, the input The password must conform to the voice rules established by the robot to achieve the corresponding work. The flexibility is not high and cannot meet the individual customization of different users.
- the present invention provides a robot capable of language interaction.
- the user can customize the voice of the corresponding action according to his own hobbies and habits, with high flexibility and meeting the personalized customization of different users.
- a robot capable of language interaction including a robot body, is characterized in that it also includes a voice recognition module, a filter matching module, a custom voice recognition module, an association module, an action collection module, and an execution module that are sequentially connected, wherein:
- Voice recognition module used to receive the user's voice password
- Filtering and matching module filter the voice password and match it with the voice password in the custom voice recognition module;
- Custom voice recognition module used to store user-defined voice passwords
- Association module used to manually associate user-defined voice passwords with corresponding action instructions
- Action set module store the action instruction set that can be directly recognized by the robot
- Execution module Make the robot body execute the corresponding action.
- it also includes an alarm module connected to the filtering and matching module, and when the user's voice password cannot be matched with the voice password in the custom voice recognition module, an alarm will be issued.
- it further includes a log recording module for recording the action log of the execution module.
- it also includes a pause module for manually suspending the actions of the execution module.
- the beneficial effect of the present invention is that the user can customize the voice to start the corresponding action according to his own hobbies and habits and store it in the custom voice recognition module, and can select the voice that is easy to recognize, such as a single digital voice, which reduces the chance of matching errors. High flexibility, can meet the personalized customization of different users.
- Fig. 1 is a structural block diagram of a robot capable of language interaction according to the present invention.
- a robot capable of language interaction includes a robot body, as shown in Figure 1, and also includes a voice recognition module, a filter matching module, a custom voice recognition module, an association module, an action collection module, and an execution module that are sequentially connected, wherein:
- Voice recognition module used to receive the user's voice password
- Filtering and matching module filter the voice password and match it with the voice password in the custom voice recognition module;
- Custom voice recognition module used to store user-defined voice passwords
- Association module used to manually associate user-defined voice passwords with corresponding action instructions
- Action set module Stores the set of action instructions that the robot can directly recognize, that is, the action instructions that the robot can execute;
- Execution module Make the robot body execute the corresponding action.
- it also includes an alarm module connected to the filtering and matching module, and when the user's voice password cannot be matched with the voice password in the custom voice recognition module, an alarm will be issued.
- it further includes a log recording module for recording the action log of the execution module.
- it also includes a pause module for manually suspending the action of the execution module.
- a pause module for manually suspending the action of the execution module. For example, when the user finds that the matching result is incorrect or wants to suspend the action, he can manually pause, or enter the voice password corresponding to the pause in advance to realize automatic pause.
- the number of times and percentage of the user's voice password is counted.
- the number and percentage of passwords that are manually suspended are counted.
- the user can customize the voice to start the corresponding action according to his own hobbies and habits and store it in the custom voice recognition module. You can choose a voice that is easy to recognize. For example, a single digital voice corresponds to a common action command, reducing the chance of matching errors and being flexible. High performance, can meet the individual customization of different users.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Manipulator (AREA)
Abstract
一种可语言交互的机器人,包括机器人本体,其特征在于,还包括顺次相连的语音识别模块、过滤匹配模块、自定义语音识别模块、关联模块、动作集合模块和执行模块,其中:语音识别模块:用于接收用户的语音口令;过滤匹配模块:对语音口令进行过滤并与自定义语音识别模块内的语音口令进行匹配;自定义语音识别模块:用于存储用户自定义的语音口令;关联模块:用于手动关联用户自定义的语音口令与对应的动作指令;动作集合模块:存储机器人可直接识别的动作指令集合;执行模块:使机器人本体执行对应的动作。用户可以根据自身爱好和习惯自定义启动对应动作的语音,灵活性高,满足不同用户的个性定制。
Description
本发明涉及一种可语言交互的机器人。
机器人(Robot)是自动执行工作的机器
装置,可以协助或取代人类的工作,可语言交互的机器人即可接收语音口令执行对应的动作,但现有设计的可语言交互的机器人,输入的语音口令必须符合机器人制定的语音规则才可以实现对应的工作,灵活性不高,无法满足不同用户的个性定制。
发明内容
针对上述问题,本发明提供一种可语言交互的机器人,用户可以根据自身爱好和习惯自定义启动对应动作的语音,灵活性高,满足不同用户的个性定制。
为实现上述技术目的,达到上述技术效果,本发明通过以下技术方案实现:
一种可语言交互的机器人,包括机器人本体,其特征在于,还包括顺次相连的语音识别模块、过滤匹配模块、自定义语音识别模块、关联模块、动作集合模块和执行模块,其中:
语音识别模块:用于接收用户的语音口令;
过滤匹配模块:对语音口令进行过滤并与自定义语音识别模块内的语音口令进行匹配;
自定义语音识别模块:用于存储用户自定义的语音口令;
关联模块:用于手动关联用户自定义的语音口令与对应的动作指 令;
动作集合模块:存储机器人可直接识别的动作指令集合;
执行模块:使机器人本体执行对应的动作。
优选,还包括与过滤匹配模块相连的报警模块,当无法匹配用户的语音口令与自定义语音识别模块内的语音口令时,则发出报警提示。
优选,还包括日志记录模块,用于记录执行模块的动作日志。
优选,还包括暂停模块,用于手动暂停执行模块的动作。
本发明的有益效果是:用户可以根据自身爱好和习惯自定义启动对应动作的语音并存储在自定义语音识别模块里,可以选择容易识别的语音,比如单个的数字语音,减少匹配失误的几率,灵活性高,可满足不同用户的个性定制。
图1是本发明一种可语言交互的机器人的结构框图。
下面结合附图和具体的实施例对本发明技术方案作进一步的详细描述,以使本领域的技术人员可以更好的理解本发明并能予以实施,但所举实施例不作为对本发明的限定。
一种可语言交互的机器人,包括机器人本体,如图1所示,还包括顺次相连的语音识别模块、过滤匹配模块、自定义语音识别模块、关联模块、动作集合模块和执行模块,其中:
语音识别模块:用于接收用户的语音口令;
过滤匹配模块:对语音口令进行过滤并与自定义语音识别模块内 的语音口令进行匹配;
自定义语音识别模块:用于存储用户自定义的语音口令;
关联模块:用于手动关联用户自定义的语音口令与对应的动作指令;
动作集合模块:存储机器人可直接识别的动作指令集合,也即机器人可以执行的动作指令;
执行模块:使机器人本体执行对应的动作。
优选,还包括与过滤匹配模块相连的报警模块,当无法匹配用户的语音口令与自定义语音识别模块内的语音口令时,则发出报警提示。
优选,还包括日志记录模块,用于记录执行模块的动作日志。
优选,还包括暂停模块,用于手动暂停执行模块的动作,比如,当用户发现匹配结果不正确或者想暂停动作时,均可以手动暂停,或者提前录入暂停对应的语音口令实现自动暂停。
优选,对用户的语音口令进行次数和所占百分比进行统计。
优选,对手动暂停执行的口令的次数和所占百分比进行统计。
用户可以根据自身爱好和习惯自定义启动对应动作的语音并存储在自定义语音识别模块里,可以选择容易识别的语音,比如单个的数字语音对应一个常用的动作指令,减少匹配失误的几率,灵活性高,可满足不同用户的个性定制。
以上仅为本发明的优选实施例,并非因此限制本发明的专利范围,凡是利用本发明说明书及附图内容所作的等效结构或者等效流程 变换,或者直接或间接运用在其他相关的技术领域,均同理包括在本发明的专利保护范围内。
Claims (6)
- 一种可语言交互的机器人,包括机器人本体,其特征在于,还包括顺次相连的语音识别模块、过滤匹配模块、自定义语音识别模块、关联模块、动作集合模块和执行模块,其中:语音识别模块:用于接收用户的语音口令;过滤匹配模块:对语音口令进行过滤并与自定义语音识别模块内的语音口令进行匹配;自定义语音识别模块:用于存储用户自定义的语音口令;关联模块:用于手动关联用户自定义的语音口令与对应的动作指令;动作集合模块:存储机器人可直接识别的动作指令集合;执行模块:使机器人本体执行对应的动作。
- 根据权利要求1所述的一种可语言交互的机器人,其特征在于,还包括与过滤匹配模块相连的报警模块,当无法匹配用户的语音口令与自定义语音识别模块内的语音口令时,则发出报警提示。
- 根据权利要求1所述的一种可语言交互的机器人,其特征在于,还包括日志记录模块,用于记录执行模块的动作日志。
- 根据权利要求3所述的一种可语言交互的机器人,其特征在于,还包括暂停模块,用于手动暂停执行模块的动作。
- 根据权利要求3所述的一种可语言交互的机器人,其特征在于,对用户的语音口令进行次数和所占百分比进行统计。
- 根据权利要求4所述的一种可语言交互的机器人,其特征在于,对手动暂停执行的口令的次数和所占百分比进行统计。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2019/120347 WO2021097822A1 (zh) | 2019-11-22 | 2019-11-22 | 一种可语言交互的机器人 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2019/120347 WO2021097822A1 (zh) | 2019-11-22 | 2019-11-22 | 一种可语言交互的机器人 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2021097822A1 true WO2021097822A1 (zh) | 2021-05-27 |
Family
ID=75981154
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2019/120347 WO2021097822A1 (zh) | 2019-11-22 | 2019-11-22 | 一种可语言交互的机器人 |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2021097822A1 (zh) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106023989A (zh) * | 2016-05-18 | 2016-10-12 | 苏州铭冠软件科技有限公司 | 一种可语言交互的机器人 |
CN106557164A (zh) * | 2016-11-18 | 2017-04-05 | 北京光年无限科技有限公司 | 应用于智能机器人的多模态输出方法和装置 |
US20170264451A1 (en) * | 2014-09-16 | 2017-09-14 | Zte Corporation | Intelligent Home Terminal and Control Method of Intelligent Home Terminal |
CN108877796A (zh) * | 2018-06-14 | 2018-11-23 | 合肥品冠慧享家智能家居科技有限责任公司 | 语音控制智能设备终端操作的方法和装置 |
-
2019
- 2019-11-22 WO PCT/CN2019/120347 patent/WO2021097822A1/zh active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20170264451A1 (en) * | 2014-09-16 | 2017-09-14 | Zte Corporation | Intelligent Home Terminal and Control Method of Intelligent Home Terminal |
CN106023989A (zh) * | 2016-05-18 | 2016-10-12 | 苏州铭冠软件科技有限公司 | 一种可语言交互的机器人 |
CN106557164A (zh) * | 2016-11-18 | 2017-04-05 | 北京光年无限科技有限公司 | 应用于智能机器人的多模态输出方法和装置 |
CN108877796A (zh) * | 2018-06-14 | 2018-11-23 | 合肥品冠慧享家智能家居科技有限责任公司 | 语音控制智能设备终端操作的方法和装置 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9424845B2 (en) | Speaker verification in a health monitoring system | |
AU2018403182B2 (en) | Computing devices with improved interactive animated conversational interface systems | |
JP6859501B2 (ja) | 通信ソフトウェアにおいて音声により業務を起動する方法及びこれに対応する装置 | |
EP3451195A1 (en) | Music recommending method and apparatus, device and storage medium | |
RU2653283C2 (ru) | Способ диалога между машиной, такой как гуманоидный робот, и собеседником-человеком, компьютерный программный продукт и гуманоидный робот для осуществления такого способа | |
US20140115456A1 (en) | System for accessing software functionality | |
WO2017059815A1 (zh) | 一种快速识别方法及家庭智能机器人 | |
WO2015068699A1 (ja) | エンタテインメント装置、表示制御方法、プログラム及び情報記憶媒体 | |
CN110476150A (zh) | 用于操作语音辨识服务的方法和支持其的电子装置 | |
JPH03163623A (ja) | 音声制御コンピュータ・インターフェース | |
CN109710727A (zh) | 用于自然语言处理的系统和方法 | |
US20140335826A1 (en) | Method and apparatus for unlocking a terminal device | |
JP7016499B2 (ja) | チャットボットを用いたユーザーケアシステム | |
CN106175727B (zh) | 一种应用于可穿戴设备的表情推送方法及可穿戴设备 | |
WO2017215186A1 (zh) | 一种安全登录方法和装置、存储介质 | |
WO2021097822A1 (zh) | 一种可语言交互的机器人 | |
CN109166584A (zh) | 语音控制方法、装置、呼吸机和存储介质 | |
CN105913842A (zh) | 一种语音自定义唤醒手机的方法 | |
US10381005B2 (en) | Systems and methods for determining user frustration when using voice control | |
CN103581726A (zh) | 一种电视设备上采用语音实现游戏控制的方法 | |
WO2016206187A1 (zh) | 一种终端控制方法、装置、终端、及计算机存储介质 | |
CN106023989A (zh) | 一种可语言交互的机器人 | |
TWI594857B (zh) | 一種對機器人進行訓練的系統及方法 | |
WO2018006367A1 (zh) | 游戏中基于多模态输入的道具购买方法及系统 | |
Hongru et al. | Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMs |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 19953402 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 19953402 Country of ref document: EP Kind code of ref document: A1 |