WO2020199963A1 - Procédé pour robot de commande d'aliment pour identifier l'intention de commande d'aliment d'un utilisateur, et robot - Google Patents
Procédé pour robot de commande d'aliment pour identifier l'intention de commande d'aliment d'un utilisateur, et robot Download PDFInfo
- Publication number
- WO2020199963A1 WO2020199963A1 PCT/CN2020/080725 CN2020080725W WO2020199963A1 WO 2020199963 A1 WO2020199963 A1 WO 2020199963A1 CN 2020080725 W CN2020080725 W CN 2020080725W WO 2020199963 A1 WO2020199963 A1 WO 2020199963A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- ordering
- intention
- user
- robot
- processor
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 43
- 235000013305 food Nutrition 0.000 title abstract description 17
- 238000004891 communication Methods 0.000 claims description 15
- 238000004590 computer program Methods 0.000 claims description 5
- 238000012545 processing Methods 0.000 claims description 4
- 238000012790 confirmation Methods 0.000 claims 1
- 238000012549 training Methods 0.000 abstract description 13
- 230000003993 interaction Effects 0.000 abstract description 3
- 230000006870 function Effects 0.000 description 11
- 235000012149 noodles Nutrition 0.000 description 5
- 238000010586 diagram Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 230000009286 beneficial effect Effects 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 235000012054 meals Nutrition 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000010295 mobile communication Methods 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000013145 classification model Methods 0.000 description 1
- 238000013499 data model Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000002716 delivery method Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/221—Announcement of recognition results
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/225—Feedback of the input speech
Definitions
- the present disclosure relates to the technical field of robot intention recognition, and in particular, to a method and a robot for an ordering robot to recognize a user's ordering intention.
- the application publication number is CN 109146717 A and the publication date of the application publication date is 2019.01.04 discloses a method for self-service ordering and delivery of food in an unmanned restaurant; the method includes: the user obtains it through his own personal mobile terminal The identification information provided by the dining table; the user opens the login interface of the ordering system through his own personal mobile terminal and logs into the ordering system; the ordering system authenticates the received identification information, and the ordering system provides the ordering interface to the user The user orders food and forms an order in the ordering system; the ordering system automatically shares the information with the restaurant robot, which receives the order and identification information, and picks up the food in the back kitchen according to the order of the order; The robot locates the table position according to the position information of the table in the identification information, and delivers the meal ordered by the user to the designated meal. ; This disclosure realizes the formation of intelligent automatic management between the user order and the meal, reduces the number of restaurant waiters, and helps reduce restaurant operating costs.
- this unmanned restaurant self-service ordering and delivery method does not have a model training function, which makes it difficult to configure voice conversion, and the machine often fails to recognize the ordering intention.
- the present disclosure provides a method and a robot for an ordering robot to recognize a user’s ordering intention, which can effectively overcome the existing technology that does not have a model training function, resulting in voice conversion. Configuration is difficult, and the machine often fails to recognize the defects of ordering intentions.
- the present disclosure provides a method for a food ordering robot to recognize a user's ordering intention, which includes the following steps:
- the ordering robot recognizes the user's ordering intention expressed by voice through the intention recognition model.
- the present disclosure also provides an ordering robot using the above method for recognizing a user’s ordering intention.
- the ordering robot is provided with a controller, a wireless communication module, a voice recognition module, a speaker and a display screen; wherein, wireless communication
- the module, voice recognition module, speaker and display screen are all electrically connected to the controller.
- the present disclosure also provides an electronic device, which includes:
- At least one processor and,
- a memory communicatively connected with the at least one processor; wherein,
- the memory stores instructions that can be executed by the at least one processor, and the instructions are executed by the at least one processor, so that the at least one processor can execute any of the foregoing first aspect or any implementation of the first aspect A method for the ordering robot to recognize the user's ordering intention.
- the present disclosure also provides a non-transitory computer-readable storage medium, the non-transitory computer-readable storage medium stores at least one executable instruction, and the executable instruction is used to make a processor execute the aforementioned first aspect or the first A method for the ordering robot in any implementation manner of the aspect to recognize the user's ordering intention.
- the present disclosure also provides a computer program product.
- the computer program product includes a calculation program stored on a non-transitory computer-readable storage medium.
- the computer program includes program instructions that, when executed by a processor, cause the The processor executes the method for the ordering robot in the foregoing first aspect or any implementation of the first aspect to recognize the user's ordering intention.
- the present disclosure provides a method and robot for an ordering robot to recognize the user's ordering intention, and the beneficial effect produced is: creatively applying the method of natural language text classification to the ordering robot
- the field of intention recognition in the ordering stage greatly improves the robot's intelligence. It does not need to be divided into verbs and nouns manually. It has a model training function. After the model is trained, the corresponding intention can be accurately identified for different sentences. Greatly increase the flexibility and simplicity of configuration and optimize the user experience.
- Figure 1 is a schematic flow diagram of the disclosed method
- FIG. 2 is a schematic diagram of the system structure of the ordering robot of the present disclosure
- FIG. 3 is a schematic structural diagram of an electronic device 30 provided by an embodiment of the disclosure.
- a method for an ordering robot to recognize a user's ordering intention includes the following steps:
- Step 1 Intent preparation.
- the developer summarizes the intent of ordering according to business characteristics
- Step 2 Data preparation, summarizing and calibrating the intent of daily ordering terms
- Step three model training, select and train an ordering intention recognition model according to the number of intentions and the scale of training data
- Step 4 online application, the ordering robot obtains the voice information of the user's order, converts the text through voice recognition technology, enters the text into the intention recognition model, obtains the probability corresponding to each intention, and applies the appropriate probability threshold set according to the business characteristics Determine the intention of the conversation;
- Step five online optimization, for the situation where the user’s ordering intention cannot be identified through comparison or the identified ordering intention is wrong, training data is added, and steps 3 and 4 are repeated to make the intention recognition more and more accurate.
- the user's ordering intention cannot be recognized, that is, the online intention recognition score is low, and the preset probability threshold is not reached.
- the ordering robot in step 4 is equipped with a controller, the controller model is STM32F103, and the controller is powered by a small battery; the ordering robot in step 4 is equipped with a wireless communication module, and the wireless communication module model is URS-GPRS- 730, the wireless communication module is electrically connected to the controller; the ordering robot in step 4 is equipped with a voice recognition module, the model of the speech recognition module is LD3320, and the speech recognition module is electrically connected to the controller; the ordering robot in step 4 is equipped with The speaker for the voice feedback information is electrically connected to the controller; the ordering robot in step 4 is provided with a display screen for confirming the ordering information for customers, and the display screen is electrically connected to the controller.
- the developer summarizes the intent of ordering according to the business characteristics and completes the intent preparation; then summarizes and calibrates the intent of the daily ordering language to complete the data preparation; secondly, select the appropriate machine learning according to the number of intents and the scale of training data Model, train the intention classification model, and complete the model training; again, the ordering robot obtains the voice information of the user’s order, converts the voice signal into text through the voice recognition module, and enters the text into the intention recognition model to obtain the probability corresponding to each intent.
- the controller controls the wireless communication module to send the ordering information to the background for processing To complete the ordering work;
- the intent recognition scheme of the existing general platform is to configure a certain intent.
- it is necessary to configure the verb and noun corresponding to the intent such as "lai a bowl of noodles" in the intent of ordering.
- the verb "lai” and nouns need to be configured during configuration.
- "A bowl of noodles” is extremely inconvenient to configure.
- One more word in the middle cannot be recognized.
- This disclosure only needs to tell the model that "coming a bowl of noodles” is an ordering intention during the data preparation stage, and does not need to be manually divided into
- the verb "lai” and the noun "one bowl of noodles" after training the model, it can accurately identify the corresponding intentions for the different sentences like "lai one noodle", which greatly increases the flexibility and simplicity of configuration, and optimizes Improve the user experience.
- This disclosure has disclosed the models of the controller, the wireless communication module, and the voice recognition module.
- the internal structure and pin functions of the electrical components involved in this disclosure are the common knowledge of those skilled in the art, and those skilled in the art have the ability to establish The circuit connection relationship between the above-mentioned electrical components, in addition, the use of a wireless communication module to establish a wireless connection is common knowledge of those skilled in the art.
- the present disclosure provides a method and a robot for an ordering robot to recognize a user's ordering intention, which produces beneficial effects: creatively applying the method of natural language field text classification to the field of intention recognition at the ordering stage of the ordering robot, It greatly improves the robot's intelligence. It does not need to be manually split into verbs and nouns. It has a model training function. After the model is trained, the corresponding intent can be accurately identified for different sentences, which greatly increases the flexibility and configuration Convenience, optimized user experience.
- the electronic device 30 includes at least one processor 301 (for example, a CPU), at least one input and output interface 304, a memory 302, and at least one communication bus 303 for implementing The connection and communication between these components.
- the at least one processor 301 is configured to execute the computer instructions stored in the memory 302, so that the at least one processor 301 can execute any of the foregoing embodiments of the method for ordering robots to recognize the user's ordering intention.
- the memory 302 is a non-transitory memory (non-transitory memory), which may include volatile memory, such as high-speed random access memory (RAM: Random Access Memory), and may also include non-volatile memory (non-volatile memory) , Such as at least one disk storage.
- volatile memory such as high-speed random access memory (RAM: Random Access Memory)
- non-volatile memory non-volatile memory
- the communication connection with at least one other device or unit is realized through at least one input and output interface 304 (which may be a wired or wireless communication interface).
- the memory 302 stores a program 3021
- the processor 301 executes the program 3021, which is used to execute any of the aforementioned food ordering robots in the method embodiment for recognizing the user's ordering intention.
- the electronic device can exist in many forms, including but not limited to:
- Mobile communication equipment This type of equipment is characterized by mobile communication functions, and its main goal is to provide voice and data communications.
- Such terminals include: smart phones (such as iPhone), multimedia phones, functional phones, and low-end phones.
- Ultra-mobile personal computer equipment This type of equipment belongs to the category of personal computers, has calculation and processing functions, and generally also has mobile Internet features.
- Such terminals include: PDA, MID and UMPC devices, such as iPad.
- Portable entertainment equipment This type of equipment can display and play multimedia content.
- Such devices include: audio, video players (such as iPod), handheld game consoles, e-books, as well as smart toys and portable car navigation devices.
- Specific server a device that provides computing services.
- the composition of a server includes a processor, hard disk, memory, system bus, etc.
- the server is similar to a general computer architecture, but due to the need to provide highly reliable services, it is High requirements in terms of performance, reliability, security, scalability, and manageability.
- the description is relatively simple, and for related parts, please refer to the partial description of the method embodiment.
- a "computer-readable medium” can be any device that can contain, store, communicate, propagate, or transmit a program for use by an instruction execution system, device, or device or in combination with these instruction execution systems, devices, or devices.
- computer readable media include the following: electrical connections (electronic devices) with one or more wiring, portable computer disk cases (magnetic devices), random access memory (RAM), Read only memory (ROM), erasable and editable read only memory (EPROM or flash memory), fiber optic devices, and portable compact disk read only memory (CDROM).
- the computer-readable medium may even be paper or other suitable media on which the program can be printed, because it can be used, for example, by optically scanning the paper or other media, and then editing, interpreting, or other suitable media if necessary. The program is processed in a manner to obtain the program electronically and then stored in the computer memory.
- multiple steps or methods can be implemented by software or firmware stored in a memory and executed by a suitable instruction execution system.
- a logic gate circuit for implementing logic functions on data signals
- Discrete logic circuits application-specific integrated circuits with suitable combinational logic gates
- PGA programmable gate array
- FPGA field programmable gate array
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Acoustics & Sound (AREA)
- Artificial Intelligence (AREA)
- Theoretical Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910250987.5 | 2019-03-29 | ||
CN201910250987.5A CN109979453A (zh) | 2019-03-29 | 2019-03-29 | 一种面向点餐机器人的智能意图识别人机交互方法 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2020199963A1 true WO2020199963A1 (fr) | 2020-10-08 |
Family
ID=67081811
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2020/080725 WO2020199963A1 (fr) | 2019-03-29 | 2020-03-23 | Procédé pour robot de commande d'aliment pour identifier l'intention de commande d'aliment d'un utilisateur, et robot |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN109979453A (fr) |
WO (1) | WO2020199963A1 (fr) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109979453A (zh) * | 2019-03-29 | 2019-07-05 | 客如云科技(成都)有限责任公司 | 一种面向点餐机器人的智能意图识别人机交互方法 |
CN113326351A (zh) * | 2021-06-17 | 2021-08-31 | 湖北亿咖通科技有限公司 | 一种用户意图确定方法及装置 |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2363251A1 (fr) * | 2010-03-01 | 2011-09-07 | Honda Research Institute Europe GmbH | Robot doté de séquences comportementales sur la base de représentations apprise de réseau de Pétri |
CN105426436A (zh) * | 2015-11-05 | 2016-03-23 | 百度在线网络技术(北京)有限公司 | 基于人工智能机器人的信息提供方法和装置 |
CN105488164A (zh) * | 2015-11-30 | 2016-04-13 | 北京光年无限科技有限公司 | 问答数据处理方法及装置、智能机器人 |
CN106205611A (zh) * | 2016-06-29 | 2016-12-07 | 北京智能管家科技有限公司 | 一种基于多模态历史响应结果的人机交互方法及系统 |
CN108932945A (zh) * | 2018-03-21 | 2018-12-04 | 北京猎户星空科技有限公司 | 一种语音指令的处理方法及装置 |
CN109427334A (zh) * | 2017-09-01 | 2019-03-05 | 王阅 | 一种基于人工智能的人机交互方法及系统 |
CN109979453A (zh) * | 2019-03-29 | 2019-07-05 | 客如云科技(成都)有限责任公司 | 一种面向点餐机器人的智能意图识别人机交互方法 |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103458056B (zh) * | 2013-09-24 | 2017-04-26 | 世纪恒通科技股份有限公司 | 自动外呼系统基于自动分类技术的语音意图判定系统 |
US20150278370A1 (en) * | 2014-04-01 | 2015-10-01 | Microsoft Corporation | Task completion for natural language input |
CN105786798B (zh) * | 2016-02-25 | 2018-11-02 | 上海交通大学 | 一种人机交互中自然语言意图理解方法 |
CN109087135B (zh) * | 2018-07-25 | 2020-08-28 | 百度在线网络技术(北京)有限公司 | 用户意图的挖掘方法及装置、计算机设备及可读介质 |
CN109087010A (zh) * | 2018-08-09 | 2018-12-25 | 杭州任你说智能科技有限公司 | 一种基于语音和大屏交互的点餐系统 |
-
2019
- 2019-03-29 CN CN201910250987.5A patent/CN109979453A/zh active Pending
-
2020
- 2020-03-23 WO PCT/CN2020/080725 patent/WO2020199963A1/fr active Application Filing
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2363251A1 (fr) * | 2010-03-01 | 2011-09-07 | Honda Research Institute Europe GmbH | Robot doté de séquences comportementales sur la base de représentations apprise de réseau de Pétri |
CN105426436A (zh) * | 2015-11-05 | 2016-03-23 | 百度在线网络技术(北京)有限公司 | 基于人工智能机器人的信息提供方法和装置 |
CN105488164A (zh) * | 2015-11-30 | 2016-04-13 | 北京光年无限科技有限公司 | 问答数据处理方法及装置、智能机器人 |
CN106205611A (zh) * | 2016-06-29 | 2016-12-07 | 北京智能管家科技有限公司 | 一种基于多模态历史响应结果的人机交互方法及系统 |
CN109427334A (zh) * | 2017-09-01 | 2019-03-05 | 王阅 | 一种基于人工智能的人机交互方法及系统 |
CN108932945A (zh) * | 2018-03-21 | 2018-12-04 | 北京猎户星空科技有限公司 | 一种语音指令的处理方法及装置 |
CN109979453A (zh) * | 2019-03-29 | 2019-07-05 | 客如云科技(成都)有限责任公司 | 一种面向点餐机器人的智能意图识别人机交互方法 |
Also Published As
Publication number | Publication date |
---|---|
CN109979453A (zh) | 2019-07-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108000526B (zh) | 用于智能机器人的对话交互方法及系统 | |
US11823659B2 (en) | Speech recognition through disambiguation feedback | |
US11100934B2 (en) | Method and apparatus for voiceprint creation and registration | |
US10360265B1 (en) | Using a voice communications device to answer unstructured questions | |
US11302337B2 (en) | Voiceprint recognition method and apparatus | |
EP3183728B1 (fr) | Système et procédé de détection d'énoncé orphelin | |
CN112071302B (zh) | 计算代理的合成语音选择 | |
US9947317B2 (en) | Pronunciation learning through correction logs | |
CN108133707B (zh) | 一种内容分享方法及系统 | |
US20190164064A1 (en) | Question and answer interaction method and device, and computer readable storage medium | |
CN108520743A (zh) | 智能设备的语音控制方法、智能设备及计算机可读介质 | |
WO2015096564A1 (fr) | Procédé et dispositif de traduction vocale en ligne | |
CN107134279A (zh) | 一种语音唤醒方法、装置、终端和存储介质 | |
WO2021056837A1 (fr) | Plateforme et procédé de personnalisation pour produit d'évaluation de qualité de service | |
WO2020177592A1 (fr) | Procédé et dispositif de réponse à une question de peinture, système de réponse à une question de peinture, et support d'informations lisible | |
WO2020199963A1 (fr) | Procédé pour robot de commande d'aliment pour identifier l'intention de commande d'aliment d'un utilisateur, et robot | |
TW202025139A (zh) | 語音互動方法、裝置及系統 | |
CN108763548A (zh) | 收集训练数据的方法、装置、设备和计算机可读存储介质 | |
TW202022849A (zh) | 語音資料的識別方法、裝置及系統 | |
CN113468894A (zh) | 对话交互方法及装置、电子设备和计算机可读存储介质 | |
CN112767916A (zh) | 智能语音设备的语音交互方法、装置、设备、介质及产品 | |
CN109830232A (zh) | 人机交互方法、装置和存储介质 | |
CN113763925A (zh) | 语音识别方法、装置、计算机设备及存储介质 | |
JP2013037512A (ja) | ソーシャルネットワーキングサービスシステム、ソーシャルネットワーキングサービスサーバ及びソーシャルネットワーキングサービスプログラム | |
WO2020199590A1 (fr) | Procédé d'analyse de détection d'humeur et dispositif associé |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20784430 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20784430 Country of ref document: EP Kind code of ref document: A1 |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20784430 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 03.05.2022) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20784430 Country of ref document: EP Kind code of ref document: A1 |