WO2020147380A1 - Human-computer interaction method and apparatus, computing device, and computer-readable storage medium - Google Patents

Human-computer interaction method and apparatus, computing device, and computer-readable storage medium Download PDF

Info

Publication number
WO2020147380A1
WO2020147380A1 PCT/CN2019/116091 CN2019116091W WO2020147380A1 WO 2020147380 A1 WO2020147380 A1 WO 2020147380A1 CN 2019116091 W CN2019116091 W CN 2019116091W WO 2020147380 A1 WO2020147380 A1 WO 2020147380A1
Authority
WO
WIPO (PCT)
Prior art keywords
screen display
question
answer
robot
preset
Prior art date
Application number
PCT/CN2019/116091
Other languages
French (fr)
Chinese (zh)
Inventor
王树良
马世奎
孙文豹
Original Assignee
深圳前海达闼云端智能科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳前海达闼云端智能科技有限公司 filed Critical 深圳前海达闼云端智能科技有限公司
Publication of WO2020147380A1 publication Critical patent/WO2020147380A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Definitions

  • the embodiments of the present application relate to the field of human-computer interaction technology, and in particular, to a human-computer interaction method, device, computing device, and computer-readable storage medium.
  • Human-computer interaction is a process of information exchange between people and computers using a certain language to complete certain tasks in a certain interactive manner.
  • intelligent technology breakthroughs have been made in the research of intelligent robots, which have been widely used in technical fields such as family life, medical treatment, and industry, and the interaction between humans and robots has become more and more diversified, such as, Text interaction, voice interaction, etc., among them, voice interaction is currently a main way of interaction between humans and robots.
  • this application is proposed to provide a human-computer interaction method, device, computing device, and computer-readable storage medium that overcome the above problems or at least partially solve the above problems.
  • a technical solution adopted in the embodiments of this application is to provide a human-computer interaction method, including obtaining the user's question voice sent by the robot; converting the question voice into question text; The answer to the question that matches the text of the question in the question; judge whether the answer to the question contains a keyword that is preset to trigger a large-screen display; if it does, send the answer to the question to the robot and send a large-screen display to the screen display device Instructions to cause the screen display device to display according to the large-screen display instruction, wherein the large-screen display instruction carries the keyword.
  • the question answer matching the question text in the preset question and answer library includes: using a preset word segmentation algorithm to split the question text into multiple words; searching in the preset question and answer library containing at least one question Calculate the similarity between the sentence and the question text to obtain the similarity value; use the answer corresponding to the sentence with the highest similarity value as the question answer of the question text.
  • the screen display device performs display according to the large-screen display instruction, wherein the carrying of the keyword in the large-screen display instruction includes: searching a preset display library according to the large-screen display instruction to include all The image file of the keyword; display the image file searched.
  • the sending a large-screen display instruction to the screen display device may further include: obtaining the identification number of the robot that sends the question voice; The screen display device sends a large-screen display instruction, wherein the screen display device is associated with the robot identity label.
  • Another technical solution adopted in the embodiments of the present application is to provide a human-computer interaction device, including: an acquisition module: used to acquire the user’s question voice sent by the robot; and a conversion module: used to convert the question voice into question text ; Matching module: used to match the answer to the question text in the preset question and answer library; Judging module: used to judge whether the answer to the question contains a preset keyword that triggers large-screen display; Sending module: Used for When the answer to the question contains a keyword that is preset to trigger a large-screen display, the answer to the question is sent to the robot, and a large-screen display instruction is sent to the screen display device, so that the screen display device responds to the large-screen display The display instruction is displayed, wherein the large-screen display instruction carries the keyword.
  • the matching module includes: a splitting unit: used to split the question text into multiple words using a preset word segmentation algorithm; a search unit: used to search a preset question and answer library that contains at least one of the The sentence of the word; calculation unit: used to calculate the similarity between the sentence and the question text to obtain the similarity value; the determination unit: used to take the answer corresponding to the sentence with the highest similarity value as the question text Answer.
  • a splitting unit used to split the question text into multiple words using a preset word segmentation algorithm
  • a search unit used to search a preset question and answer library that contains at least one of the The sentence of the word
  • calculation unit used to calculate the similarity between the sentence and the question text to obtain the similarity value
  • the determination unit used to take the answer corresponding to the sentence with the highest similarity value as the question text Answer.
  • the sending module includes: a search unit: used to search for an image file containing the keyword in a preset display library according to the large-screen display instruction; a display unit: used to display the searched image file .
  • the sending module further includes:
  • Obtaining unit used to obtain the identity label of the robot that sent the question voice; sending unit: used to send a large-screen display instruction to the screen display device, wherein the screen display device is associated with the robot identity label.
  • a computing device including: a processor, a memory, a communication interface, and a communication bus.
  • the processor, the memory, and the communication interface are completed through the communication bus.
  • Mutual communication the memory is used to store at least one executable instruction, and the executable instruction causes the processor to perform operations corresponding to a human-computer interaction method.
  • Another technical solution adopted in the embodiments of the present application is to provide a computer-readable storage medium in which at least one executable instruction is stored, and the executable instruction causes a processor to execute a corresponding human-computer interaction method. Operation.
  • the embodiment of the present application presets keywords that trigger the large-screen display in the preset question and answer library.
  • the answer to the question is
  • the screen is triggered to display the content related to the keyword, so that in addition to the voice interaction between the robot and the human, the related content can be vividly displayed through the video image; in addition, when there are multiple robots
  • large-screen display equipment by associating the large-screen display device with the robot’s identity label, it is possible to display the video and audio files corresponding to the screen display keywords contained in the question voice sent by the specific robot on the specific large-screen display device , To ensure the security of the content displayed on the screen.
  • FIG. 1 is a flowchart of a human-computer interaction method according to an embodiment of the present application
  • FIG. 2 is a flowchart of a question answer matching question text in a human-computer interaction method according to an embodiment of the present application
  • FIG. 3 is a flowchart of sending a large-screen display instruction to a screen display device in a human-computer interaction method according to another embodiment of the present application;
  • FIG. 4 is a functional block diagram of a human-computer interaction device according to an embodiment of the present application.
  • Fig. 5 is a schematic diagram of a computing device according to an embodiment of the present application.
  • FIG. 1 is a flowchart of an embodiment of a human-computer interaction method according to this application. As shown in Figure 1, the method includes the following steps:
  • Step S101 Obtain the user's question voice sent by the robot.
  • the robot user has a dialogue with the robot, and the dialogue mode can be a mobile terminal APP or a robot body.
  • a robot control unit is arranged inside the robot body, and the robot control unit can receive question voices sent by a user.
  • the robot described here includes a robot body and a robot control unit.
  • Step S102 Convert the question voice into question text.
  • the robot control unit uses a preset voice transmission channel to transmit it to the robot management platform.
  • the preset voice transmission channel is a preset channel dedicated to voice transmission, such as , Zeroc ice channel.
  • the robot management platform uses a preset voice conversion algorithm to convert the question voice into question text.
  • the preset voice conversion algorithm is an existing technology, such as the voice developed by iFlytek Conversion algorithm.
  • Step S103 Match the question answer of the question text in the preset question answering library.
  • the question text is matched with the preset question and answer library to obtain the answer to the question.
  • the question answer matching the question text in the preset question answering library includes the following steps:
  • Step S1031 Use a preset word segmentation algorithm to split the question text into multiple words.
  • the preset word segmentation algorithm is the prior art.
  • the question sentence is split, the sentence is split into a combination of several words according to the composition of the word in the sentence, such as: The sentence is "what to eat today?".
  • splitting the question sentence will be split into a combination of "today”, "eat” and "what".
  • Step S1032 Search for sentences containing at least one of the words in the preset question and answer library.
  • the format of the sentences stored in the preset question and answer library is a question-and-answer form, that is, there is an answer to the corresponding question after each question.
  • the preset question and answer library is searched, and the words and The question in the preset question and answer library is searched for matching, for example, "What’s eating today?"
  • the split words are a combination of the three words "today”, "eat” and "what".
  • the three words are used as the search content to be matched with the questions in the preset question and answer library respectively.
  • Step S1033 Calculate the similarity between the sentence and the question text to obtain a similarity value.
  • the sentence in which the word is located is calculated for the similarity with the question text.
  • the calculation algorithm is the prior art and is not limited here. For example, if the question text is "What do you eat today?", after matching with the question in the preset question and answer library, one of the matched sentences is "What do you eat for lunch", then the word matching degree is used as the similarity calculation In the algorithm, there are two overlapping words between the question text and the matched sentence, namely "eat” and "what". The total word segmentation in the sentence is three words, so the similarity value is 67%.
  • Step S1034 Use the answer corresponding to the sentence with the highest similarity value as the answer to the question text.
  • Step S104 Determine whether the answer to the question contains a keyword that is preset to trigger a large-screen display, if it contains, execute step S105, if not, execute step S106.
  • the preset question and answer library is preset with keywords that trigger large-screen display, and the keywords displayed on the large-screen are associated with image names in the image library displayed on the large-screen.
  • Step S105 Send the answer to the question to the robot, and send a large-screen display instruction to the screen display device, so that the screen display device displays according to the large-screen display instruction, wherein the large-screen display instruction carries The keywords.
  • a large screen display instruction is sent to the screen display device, and the large screen
  • the display instruction carries the keywords displayed on the large screen, and the keywords displayed on the large screen are keywords preset in the preset question and answer database, for example, when the answer to the question is retrieved containing "iphone8"
  • a large-screen display instruction is generated, and the large-screen display instruction is sent to the screen display device.
  • the screen display device after the screen display device receives the large-screen display instruction, it searches for the image file containing the keyword in a preset display library according to the large-screen display instruction, and displays the searched Image file. Specifically, according to the keyword, the screen display device is called to process an image file in a preset large-screen display image library, the image file is named after the keyword, or is related to the keyword rule When the image file is retrieved, the image file is played on the screen display device.
  • the screen display device and the robot management platform are bridged through a back-end server.
  • the robot management platform will The keyword is sent to the back-end server.
  • the back-end server receives the keyword, it generates a large-screen display instruction, and sends the large-screen display instruction to the screen display device through a preset protocol.
  • the preset The protocol corresponds to the background server, for example, the background server is a message queue telemetry transmission (MQTT) server, and the preset protocol is a message queue telemetry transmission protocol.
  • MQTT message queue telemetry transmission
  • Step S106 Send the answer to the question to the robot.
  • the robot when the robot receives the answer to the question, it will broadcast the answer to the user in voice.
  • the conversion algorithm is the existing technology. No longer.
  • the conversion algorithm can be preset in the robot control unit, and the converted voice can be directly sent to the robot body, or when the robot body receives the answer to the question, the robot control unit controls the robot body according to The preset conversion algorithm is converted into speech, and the specific conversion method is not limited here.
  • the voice communication between the robot and the user is realized.
  • the video is interactive at the same time, so that the robot can show the dialogue content to the user in a more vivid and concrete way.
  • multiple robots and screen display devices can be configured, and a corresponding screen display device is assigned to each robot in advance.
  • the corresponding screen display device is controlled to display
  • the step S105 sending a large-screen display instruction to the screen display device includes the following steps:
  • Step S301 Obtain the identification number of the robot sending the question voice.
  • the robot is preset with an identity label, and when the robot receives the question voice sent by the user, the question voice is sent to the robot management platform by carrying the robot identity label.
  • Step S302 Send a large-screen display instruction to the screen display device, where the screen display device is associated with the robot identity label.
  • the screen display device is pre-associated with the robot identity label, and when the answer to the question contains the keyword that triggers the large-screen display, the robot management platform will mark the identity label with the robot.
  • the associated screen display device sends a large-screen display instruction, so that the screen display device associated with the robot identity performs display according to the large-screen display instruction.
  • the identification numbers of the two robots are 1 and 2, respectively, the three screen display devices are marked as A, B, C, and the large-screen display devices A and B Subscribing to the question voice sent by robot 1, and large-screen display device C subscribes to the question voice sent by robot 2, and when the question voice received by robot 1 contains keywords that trigger the large-screen display, the robot management platform will carry The large-screen display instruction of the keyword is sent to the screen display devices A and B, but not to C.
  • the large-screen display device can set a display library according to the use of its associated robot.
  • the robot 1 is used for diagnosis, and the preset display library can be set to be related to the location of each department of the hospital. Video files.
  • Fig. 4 is a functional block diagram of a human-computer interaction device of the present application.
  • the device includes: an acquisition module 401, a conversion module 402, a matching module 403, a judgment module 404, and a sending module 405, wherein the The obtaining module 401 is used to obtain the user's question voice sent by the robot; the conversion module 402 is used to convert the question voice into question text; the matching module 403 is used to match the question text of the question text in a preset question and answer library Answer; judging module 404, for judging whether the answer to the question contains a keyword preset to trigger large-screen display; sending module 405, for when the answer to the question contains a keyword preset to trigger large-screen display, Send the answer to the question to the robot, and send a large-screen display instruction to the screen display device, so that the screen display device displays according to the large-screen display instruction, wherein the large-screen display instruction carries the key word.
  • the matching module 403 includes: a splitting unit 4031, a searching unit 4032, a calculation unit 4033, and a determining unit 4034.
  • the splitting unit 4031 is used to split the question text into multiple words using a preset word segmentation algorithm
  • the searching unit 4032 is used to search for sentences containing at least one of the words in the preset question and answer library;
  • the calculating unit 4033 is used to calculate the similarity between the sentences and the question text to obtain the similarity value;
  • the determining unit 4034 Used to use the answer corresponding to the sentence with the highest similarity value as the answer to the question text.
  • the sending module 405 includes a search unit 4051 and a display unit 4052.
  • the search unit 4051 is configured to search for an image file containing the keyword in a preset display library according to the large-screen display instruction;
  • the display unit 4052 is configured to display the searched image file.
  • the sending module further includes: an acquiring unit 4053 and a sending unit 4054, wherein the acquiring unit 4053 is configured to acquire and send the question The voice robot identity label; the sending unit 4054 is configured to send a large-screen display instruction to the screen display device, where the screen display device is associated with the robot identity label.
  • the judgment module is used to judge whether the preset Q&A library contains a preset keyword that triggers large-screen display, and when the keyword is contained, the large-screen display instruction is sent to the screen display device through the sending module, so that The screen display device displays the content related to the keyword, so that the robot can display the video image vividly through the screen display device during the dialogue with the human; in addition, when there are multiple robots and large-screen display devices At the time, the robot’s identity label is obtained through the acquisition unit, and the large-screen display instruction is sent to the large-screen display device associated with the robot’s identity label through the sending module, thereby realizing the screen display contained in the question voice sent by the specific robot The video and audio files corresponding to the keywords are displayed on a specific large-screen display device to ensure the security of the content displayed on the screen.
  • the embodiments of the present application provide a non-volatile computer-readable storage medium, the computer-readable storage medium stores at least one executable instruction, and the computer-executable instruction can execute one of the above-mentioned method embodiments. Machine interaction method.
  • FIG. 5 is a schematic structural diagram of an embodiment of a computing device of this application, and the specific embodiment of this application does not limit the specific implementation of the computing device.
  • the computing device may include: a processor (processor) 502, a communication interface (Communications Interface) 504, a memory (memory) 506, and a communication bus 508.
  • processor processor
  • communication interface Communication Interface
  • memory memory
  • the processor 502, the communication interface 504, and the memory 506 communicate with each other through the communication bus 508.
  • the communication interface 504 is used to communicate with other devices.
  • the processor 502 is configured to execute the program 510, and specifically can execute the relevant steps in the foregoing embodiment of the human-computer interaction method.
  • the program 510 may include program code, and the program code includes computer operation instructions.
  • the processor 502 may be a central processing unit CPU, or an ASIC (Application Specific Integrated Circuit), or one or more integrated circuits configured to implement the embodiments of the present application.
  • the one or more processors included in the computing device may be the same type of processor, such as one or more CPUs, or different types of processors, such as one or more CPUs and one or more ASICs.
  • the memory 506 is used to store the program 510.
  • the memory 506 may include a high-speed RAM memory, and may also include a non-volatile memory (non-volatile memory), for example, at least one disk memory.
  • the program 510 may be specifically used to cause the processor 502 to perform the following operations:
  • the instruction carries the keyword.
  • program 510 may be further specifically used to cause the processor 502 to perform the following operations:
  • the program 510 may be further configured to cause the processor 502 to perform the following operations: according to the large-screen display instruction, search for an image file containing the keyword in a preset display library; The image file.
  • the program 510 may be further specifically configured to cause the processor 502 to perform the following operations: Obtain the robot that sent the question voice Identity label; sending a large-screen display instruction to the screen display device, wherein the screen display device is associated with the robot identity label.
  • modules or units or components in the embodiments may be combined into one module or unit or component, and in addition, they may be divided into a plurality of submodules or subunits or subcomponents. Except that at least some of such features and/or processes or units are mutually exclusive, all features disclosed in this specification (including the accompanying claims, abstract and drawings) and any method so disclosed may be adopted in any combination All processes or units of equipment are combined. Unless expressly stated otherwise, each feature disclosed in this specification (including the accompanying claims, abstract and drawings) may be replaced by alternative features serving the same, equivalent or similar purpose.
  • Each component embodiment of the present application may be implemented by hardware, or by software modules running on one or more processors, or by a combination of them.
  • a microprocessor or a digital signal processor (DSP) may be used in practice to implement some or all of the functions of some or all of the components in the human-computer interaction device according to the embodiments of the present application.
  • the application can also be implemented as a device or device program (for example, a computer program and a computer program product) for executing part or all of the methods described herein.
  • Such a program for implementing the present application may be stored on a computer-readable medium, or may have the form of one or more signals.
  • Such a signal can be downloaded from an Internet website, or provided on a carrier signal, or provided in any other form.

Abstract

The present application relates to the technical field of human-computer interaction, in particular, to a human-computer interaction method and apparatus, a computing device, and a computer-readable storage medium. The method comprises: obtaining a question speech of a user sent by a robot; converting the question speech into question text; matching a question answer of the question text in a preset question and answer bank; determining whether the question answer comprises a preset keyword for triggering large-screen display; and if yes, sending the question answer to the robot, and sending a large-screen display instruction to a screen display device, so that the screen display device performs display according to the large-screen display instruction, wherein the large-screen display instruction carries the keyword. Thus, the use of the solution of the present application can make a robot communicate with a person by means of screen display.

Description

人机交互方法、装置、计算设备及计算机可读存储介质Human-computer interaction method, device, computing equipment and computer readable storage medium 技术领域Technical field
本申请实施例涉及人机交互技术领域,特别是涉及一种人机交互方法、装置、计算设备及计算机可读存储介质。The embodiments of the present application relate to the field of human-computer interaction technology, and in particular, to a human-computer interaction method, device, computing device, and computer-readable storage medium.
背景技术Background technique
人机交互是人与计算机之间使用某种语言对话,以一定的交互方式,完成确定任务的人与计算机之间的信息交换过程。随着智能技术的发展,智能机器人的研究取得了突破性的进展,在家庭生活、医疗、工业等技术领域得到了广泛应用,人与机器人之间的交互方式也越来越多样化,如,文字交互、语音交互等,其中,语音交互是当前人与机器人交互的一个主要方式。Human-computer interaction is a process of information exchange between people and computers using a certain language to complete certain tasks in a certain interactive manner. With the development of intelligent technology, breakthroughs have been made in the research of intelligent robots, which have been widely used in technical fields such as family life, medical treatment, and industry, and the interaction between humans and robots has become more and more diversified, such as, Text interaction, voice interaction, etc., among them, voice interaction is currently a main way of interaction between humans and robots.
本申请的发明人在实现本申请的过程中,发现:现有机器人通过文字或语音与人沟通,无法通过屏幕展示与会话内容相关的影像。In the process of realizing this application, the inventor of the present application found that: existing robots communicate with people through text or voice, and cannot display images related to the conversation content through the screen.
申请内容Application content
鉴于上述问题,提出了本申请以便提供一种克服上述问题或者至少部分地解决上述问题的一种人机交互方法、装置、计算设备及计算机可读存储介质。In view of the above problems, this application is proposed to provide a human-computer interaction method, device, computing device, and computer-readable storage medium that overcome the above problems or at least partially solve the above problems.
为解决上述技术问题,本申请实施例采用的一个技术方案是:提供一种人机交互方法,包括获取机器人发送的用户的问题语音;将所述问题语音转换成问题文字;在预设问答库中匹配所述问题文字的问题答案;判断所述问题答案中是否包含预设触发大屏展示的关键字;若包含,向所述机器人发送所述问题答案,并且向屏幕展示设备发送大屏展示指令,以使所述屏幕展示设备根据所述大屏展示指令进行显示,其中,所述大屏展示指令携带所述关键字。In order to solve the above technical problems, a technical solution adopted in the embodiments of this application is to provide a human-computer interaction method, including obtaining the user's question voice sent by the robot; converting the question voice into question text; The answer to the question that matches the text of the question in the question; judge whether the answer to the question contains a keyword that is preset to trigger a large-screen display; if it does, send the answer to the question to the robot and send a large-screen display to the screen display device Instructions to cause the screen display device to display according to the large-screen display instruction, wherein the large-screen display instruction carries the keyword.
可选的,所述在预设问答库中匹配所述问题文字的问题答案包括:使用预设分词算法将所述问题文字拆分为多个词语;在预设问答库中搜索包含至少一个所述词语的语句;计算所述语句和所述问题文字的相似度,得到相似度值;将所述相似度值最高的语句对应的答复作为所述问题文字的问题答案。Optionally, the question answer matching the question text in the preset question and answer library includes: using a preset word segmentation algorithm to split the question text into multiple words; searching in the preset question and answer library containing at least one question Calculate the similarity between the sentence and the question text to obtain the similarity value; use the answer corresponding to the sentence with the highest similarity value as the question answer of the question text.
可选的,所述屏幕展示设备根据所述大屏展示指令进行显示,其中,所述大屏展示指令携带所述关键字包括:根据所述大屏展示指令在预设展示库中搜索包含所述关键字的影像文件;显示搜索到所述影像文件。Optionally, the screen display device performs display according to the large-screen display instruction, wherein the carrying of the keyword in the large-screen display instruction includes: searching a preset display library according to the large-screen display instruction to include all The image file of the keyword; display the image file searched.
可选的,当所述机器人所述屏幕展示设备的数量均为多个时,所述向屏幕展示设备发送大屏展示指令,还包括:获取发送所述问题语音的机器人身份标号;向所述屏幕展示设备发送大屏展示指令,其中,所述屏幕展示设备与所述机器人身份标号相关联。Optionally, when the number of the screen display devices of the robot is multiple, the sending a large-screen display instruction to the screen display device may further include: obtaining the identification number of the robot that sends the question voice; The screen display device sends a large-screen display instruction, wherein the screen display device is associated with the robot identity label.
本申请实施例采用的另一个技术方案是:提供一种人机交互装置,包括:获取模块:用于获取机器人发送的用户的问题语音;转换模块:用于将所述问题语音转换成问题文字;匹配模块:用于在预设问答库中匹配所述问题文字的问题答案;判断模块:用于判断所述问题答案中是否包含预设触发大屏展示的关键字;发送模块:用于当所述问题答案中包含预设触发大屏展示的关键字时,向所述机器人发送所述问题答案,并且向屏幕展示设备发送大屏展示指令,以使所述屏幕展示设备根据所述大屏展示指令进行显示,其中,所述大屏展示指令携带所述关键字。Another technical solution adopted in the embodiments of the present application is to provide a human-computer interaction device, including: an acquisition module: used to acquire the user’s question voice sent by the robot; and a conversion module: used to convert the question voice into question text ; Matching module: used to match the answer to the question text in the preset question and answer library; Judging module: used to judge whether the answer to the question contains a preset keyword that triggers large-screen display; Sending module: Used for When the answer to the question contains a keyword that is preset to trigger a large-screen display, the answer to the question is sent to the robot, and a large-screen display instruction is sent to the screen display device, so that the screen display device responds to the large-screen display The display instruction is displayed, wherein the large-screen display instruction carries the keyword.
可选的,所述匹配模块包括:拆分单元:用于使用预设分词算法将所述问题文字拆分为多个词语;搜索单元:用于在预设问答库中搜索包含至少一个所述词语的语句;计算单元:用于计算所述语句和所述问题文字的相似度,得到相似度值;确定单元:用于将所述相似度值最高的语句对应的答复作为所述问题文字的问题答案。Optionally, the matching module includes: a splitting unit: used to split the question text into multiple words using a preset word segmentation algorithm; a search unit: used to search a preset question and answer library that contains at least one of the The sentence of the word; calculation unit: used to calculate the similarity between the sentence and the question text to obtain the similarity value; the determination unit: used to take the answer corresponding to the sentence with the highest similarity value as the question text Answer.
可选的,所述发送模块包括:搜索单元:用于根据所述大屏展示指令在预设展示库中搜索包含所述关键字的影像文件;显示单元:用于显示搜索到所述影像文件。Optionally, the sending module includes: a search unit: used to search for an image file containing the keyword in a preset display library according to the large-screen display instruction; a display unit: used to display the searched image file .
可选的,当所述机器人与所述屏幕展示设备的数量均为多个时,所述发送模块还包括:Optionally, when the numbers of the robot and the screen display device are both multiple, the sending module further includes:
获取单元:用于获取发送所述问题语音的机器人身份标号;发送单元:用于向所述屏幕展示设备发送大屏展示指令,其中,所述屏幕展示设备与所述机器人身份标号相关联。Obtaining unit: used to obtain the identity label of the robot that sent the question voice; sending unit: used to send a large-screen display instruction to the screen display device, wherein the screen display device is associated with the robot identity label.
本申请实施例采用的再一技术方案是:提供一种计算设备,包括:处理器、存储器、通信接口和通信总线,所述处理器、所述存储器和所述通信接口通过所述通信总线完成相互间的通信;所述存储器用于存放至少一可执行指令,所述可执行指令使所述处理器执行如一种人机交互方法对应的操作。Another technical solution adopted in the embodiments of the present application is to provide a computing device, including: a processor, a memory, a communication interface, and a communication bus. The processor, the memory, and the communication interface are completed through the communication bus. Mutual communication; the memory is used to store at least one executable instruction, and the executable instruction causes the processor to perform operations corresponding to a human-computer interaction method.
本申请实施例采用的又一技术方案是,提供一种计算机可读存储介质,所 述存储介质中存储有至少一可执行指令,所述可执行指令使处理器执行如一种人机交互方法对应的操作。Another technical solution adopted in the embodiments of the present application is to provide a computer-readable storage medium in which at least one executable instruction is stored, and the executable instruction causes a processor to execute a corresponding human-computer interaction method. Operation.
本申请实施例的有益效果是:区别于现有技术的情况,本申请实施例通过在预设问答库中预设触发大屏展示的关键字当机器人在与人语音对话的过程中,问题答案包含该关键词时,触发屏幕展示所述关键字相关的内容,从而实现除了机器人与人的交互除了语音交互之外,还可以通过视频形象生动的展示相关的内容;此外,当存在多台机器人及大屏展示设备时,通过将大屏展示设备与机器人的身份标号相关联,从而实现了特定的机器人发送的问题语音包含的屏幕展示关键字对应的影音文件在特定的大屏展示设备上展示,保证了屏幕展示内容的安全性。The beneficial effect of the embodiment of the present application is: different from the prior art, the embodiment of the present application presets keywords that trigger the large-screen display in the preset question and answer library. When the robot is in the process of voice dialogue with the human, the answer to the question is When the keyword is included, the screen is triggered to display the content related to the keyword, so that in addition to the voice interaction between the robot and the human, the related content can be vividly displayed through the video image; in addition, when there are multiple robots And large-screen display equipment, by associating the large-screen display device with the robot’s identity label, it is possible to display the video and audio files corresponding to the screen display keywords contained in the question voice sent by the specific robot on the specific large-screen display device , To ensure the security of the content displayed on the screen.
上述说明仅是本申请技术方案的概述,为了能够更清楚了解本申请的技术手段,而可依照说明书的内容予以实施,并且为了让本申请的上述和其它目的、特征和优点能够更明显易懂,以下特举本申请的具体实施例。The above description is only an overview of the technical solutions of this application. In order to understand the technical means of this application more clearly, it can be implemented in accordance with the content of the specification, and in order to make the above and other objectives, features and advantages of this application more obvious and understandable. In the following, specific examples of the application are cited.
附图说明BRIEF DESCRIPTION
通过阅读下文优选实施例的详细描述,各种其他的优点和益处对于本领域普通技术人员将变得清楚明了。附图仅用于示出优选实施例的目的,而并不认为是对本申请的限制。而且在整个附图中,用相同的参考符号表示相同的部件。在附图中:By reading the detailed description of the preferred embodiments below, various other advantages and benefits will become clear to those of ordinary skill in the art. The drawings are only used for the purpose of illustrating preferred embodiments, and are not considered as a limitation to the application. Furthermore, throughout the drawings, the same reference symbols are used to denote the same components. In the drawings:
图1是本申请实施例的一种人机交互方法流程图;FIG. 1 is a flowchart of a human-computer interaction method according to an embodiment of the present application;
图2是本申请实施例的一种人机交互方法中匹配问题文字的问题答案的流程图;2 is a flowchart of a question answer matching question text in a human-computer interaction method according to an embodiment of the present application;
图3是本申请另一实施例的一种人机交互方法中向屏幕展示设备发送大屏展示指令流程图;3 is a flowchart of sending a large-screen display instruction to a screen display device in a human-computer interaction method according to another embodiment of the present application;
图4是本申请实施例的一种人机交互装置的功能框图;4 is a functional block diagram of a human-computer interaction device according to an embodiment of the present application;
图5是本申请实施例的一种计算设备的示意图。Fig. 5 is a schematic diagram of a computing device according to an embodiment of the present application.
具体实施方式detailed description
下面将参照附图更详细地描述本公开的示例性实施例。虽然附图中显示了本公开的示例性实施例,然而应当理解,可以以各种形式实现本公开而不应被这里阐述的实施例所限制。相反,提供这些实施例是为了能够更透彻地理解本 公开,并且能够将本公开的范围完整的传达给本领域的技术人员。图1为本申请一种人机交互方法实施例的流程图。如图1所示,该方法包括以下步骤:Hereinafter, exemplary embodiments of the present disclosure will be described in more detail with reference to the accompanying drawings. Although the exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure can be implemented in various forms and should not be limited by the embodiments set forth herein. On the contrary, these embodiments are provided to enable a more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art. Fig. 1 is a flowchart of an embodiment of a human-computer interaction method according to this application. As shown in Figure 1, the method includes the following steps:
步骤S101:获取机器人发送的用户的问题语音。Step S101: Obtain the user's question voice sent by the robot.
在本步骤中,机器人使用者与所述机器人对话,所述对话方式可以是移动终端APP,或是机器人本体。所述机器人本体内部设置有机器人控制单元,所述机器人控制单元可以接收用户发送的问题语音,在此所述的机器人包括机器人本体及机器人控制单元。In this step, the robot user has a dialogue with the robot, and the dialogue mode can be a mobile terminal APP or a robot body. A robot control unit is arranged inside the robot body, and the robot control unit can receive question voices sent by a user. The robot described here includes a robot body and a robot control unit.
步骤S102:将所述问题语音转换成问题文字。Step S102: Convert the question voice into question text.
在本步骤中,当所述机器人控制单元接收到所述问题语音,使用预设语音传输通道传输给机器人管理平台,所述预设语音传输通道是预设的专门用来传输语音的通道,如,zeroc ice通道。所述机器人管理平台接收到所述问题语音后,使用预设的语音转化算法将所述问题语音转化成问题文字,所述预设语音转换算法是现有技术,如,科大讯飞研发的语音转化算法。In this step, when the robot control unit receives the question voice, it uses a preset voice transmission channel to transmit it to the robot management platform. The preset voice transmission channel is a preset channel dedicated to voice transmission, such as , Zeroc ice channel. After receiving the question voice, the robot management platform uses a preset voice conversion algorithm to convert the question voice into question text. The preset voice conversion algorithm is an existing technology, such as the voice developed by iFlytek Conversion algorithm.
步骤S103:在预设问答库中匹配所述问题文字的问题答案。Step S103: Match the question answer of the question text in the preset question answering library.
在本步骤中,将所述问题文字与预设问答库匹配,得到问题答案。如图2所示,所述在预设问答库中匹配所述问题文字的问题答案包括以下步骤:In this step, the question text is matched with the preset question and answer library to obtain the answer to the question. As shown in Figure 2, the question answer matching the question text in the preset question answering library includes the following steps:
步骤S1031:使用预设分词算法将所述问题文字拆分为多个词语。Step S1031: Use a preset word segmentation algorithm to split the question text into multiple words.
在本步骤中,所述预设分词算法是现有技术,在进行问题语句拆分时,将所述语句按照词语在句子中的成分拆分成几个词语的组合,如:当所述问题语句为“今天吃什么?”,在进行拆分时,所述问题语句会拆分成“今天”、“吃”及“什么”三个词语的组合。In this step, the preset word segmentation algorithm is the prior art. When the question sentence is split, the sentence is split into a combination of several words according to the composition of the word in the sentence, such as: The sentence is "what to eat today?". When splitting, the question sentence will be split into a combination of "today", "eat" and "what".
步骤S1032:在预设问答库中搜索包含至少一个所述词语的语句。Step S1032: Search for sentences containing at least one of the words in the preset question and answer library.
在本步骤,所述预设问答库中存储语句的形式是一问一答的形式,即每一个问题后面有相应问题的答案,所述在预设问答库中搜索,具体将所述词语与所述预设问答库中的问题进行匹配搜索,如,“今天吃什么?”拆分后的词语为“今天”、“吃”及“什么”三个词语的组合,在进行搜索时,将三个词语作为查找内容,分别与所述预设问答库中的问题进行匹配。In this step, the format of the sentences stored in the preset question and answer library is a question-and-answer form, that is, there is an answer to the corresponding question after each question. The preset question and answer library is searched, and the words and The question in the preset question and answer library is searched for matching, for example, "What’s eating today?" The split words are a combination of the three words "today", "eat" and "what". The three words are used as the search content to be matched with the questions in the preset question and answer library respectively.
步骤S1033:计算所述语句和所述问题文字的相似度,得到相似度值。Step S1033: Calculate the similarity between the sentence and the question text to obtain a similarity value.
在本步骤中,当在所述预设问答库中匹配到所述词语之后,将所述词语所在的语句与所述问题文字计算相似度,计算算法为现有技术,在此不做限定。 如,所述问题文字为“今天吃什么?”,通过与所述预设问答库中的问题匹配后,匹配到的其中一条语句为“午饭吃什么”,则通过词语匹配度作为相似度计算算法时,所述问题文字与所述匹配到的语句之间共重合两个词,即“吃”和“什么”,所述语句中的分词总量为三个词,因此,相似度值为67%。In this step, after the word is matched in the preset question and answer library, the sentence in which the word is located is calculated for the similarity with the question text. The calculation algorithm is the prior art and is not limited here. For example, if the question text is "What do you eat today?", after matching with the question in the preset question and answer library, one of the matched sentences is "What do you eat for lunch", then the word matching degree is used as the similarity calculation In the algorithm, there are two overlapping words between the question text and the matched sentence, namely "eat" and "what". The total word segmentation in the sentence is three words, so the similarity value is 67%.
步骤S1034:将所述相似度值最高的语句对应的答复作为所述问题文字的问题答案。Step S1034: Use the answer corresponding to the sentence with the highest similarity value as the answer to the question text.
在本步骤中,将所述预设问答库中检索到的每一条语句与所述问题文字计算相似度,并认为所述相似度最高的一条语句与所述问题文字最接近,可以理解的是,当用户对所述机器人发送一条问题语句后,希望得到是所述输入问题语句对应的答复,因此,将所述预设问答库中相似度最高的语句对应的答复发送作为所述问题文字的问题答案。In this step, calculate the similarity between each sentence retrieved in the preset question answering database and the question text, and consider that the sentence with the highest similarity is the closest to the question text, which is understandable After the user sends a question sentence to the robot, he hopes to get the answer corresponding to the input question sentence. Therefore, the answer corresponding to the sentence with the highest similarity in the preset question and answer library is sent as the question text Answer.
步骤S104:判断所述问题答案中是否包含预设触发大屏展示的关键字,若包含,执行步骤S105,若不包含,执行步骤S106。Step S104: Determine whether the answer to the question contains a keyword that is preset to trigger a large-screen display, if it contains, execute step S105, if not, execute step S106.
在本步骤中,所述预设问答库中预设有触发大屏展示的关键字,所述大屏展示的关键字与所述大屏展示的影像库中的影像名相关联。In this step, the preset question and answer library is preset with keywords that trigger large-screen display, and the keywords displayed on the large-screen are associated with image names in the image library displayed on the large-screen.
步骤S105:向所述机器人发送所述问题答案,并且向屏幕展示设备发送大屏展示指令,以使所述屏幕展示设备根据所述大屏展示指令进行显示,其中,所述大屏展示指令携带所述关键字。Step S105: Send the answer to the question to the robot, and send a large-screen display instruction to the screen display device, so that the screen display device displays according to the large-screen display instruction, wherein the large-screen display instruction carries The keywords.
在本步骤中,当所述问题答案中包含预设大屏展示的关键字时,向所述机器人发送所述问题答案的同时,向所述屏幕展示设备发送大屏展示指令,所述大屏展示指令中携带所述大屏展示的关键字,所述大屏展示的关键字是预设在所述预设问答库中的关键词,如,当检索到所述问题答案中含有“iphone8”一词时,会生成一个大屏展示指令,并将所述大屏展示指令发送给屏幕展示设备。In this step, when the answer to the question contains a keyword displayed on a preset large screen, while sending the answer to the question to the robot, a large screen display instruction is sent to the screen display device, and the large screen The display instruction carries the keywords displayed on the large screen, and the keywords displayed on the large screen are keywords preset in the preset question and answer database, for example, when the answer to the question is retrieved containing "iphone8" When one word is used, a large-screen display instruction is generated, and the large-screen display instruction is sent to the screen display device.
需要说明的是,当所述屏幕展示设备接收到所述大屏展示指令后,根据所述大屏展示指令在预设展示库中搜索包含所述关键字的影像文件,并显示搜索到所述影像文件。具体的,根据所述关键字调用所述屏幕展示设备的处理其中预设的大屏展示的影像库中的影像文件,所述影像文件以所述关键字命名,或与所述关键字正则相关,当检索到所述影像文件时,在所述屏幕展示设备上播放所述影像文件。It should be noted that, after the screen display device receives the large-screen display instruction, it searches for the image file containing the keyword in a preset display library according to the large-screen display instruction, and displays the searched Image file. Specifically, according to the keyword, the screen display device is called to process an image file in a preset large-screen display image library, the image file is named after the keyword, or is related to the keyword rule When the image file is retrieved, the image file is played on the screen display device.
应理解,当使用所述关键字检索所述影像文件时,与所述关键字相关的影 像文件可能不止一个,此时,计算所述关键字与所述影像文件的文件名的相似度,并播放所述相似度最高的影像文件名对应的影像文件,具体计算相似度方式可以参照在预设问答库中匹配问题文字的问题答案时所述的操作,在此不再赘述。It should be understood that when using the keyword to retrieve the image file, there may be more than one image file related to the keyword. In this case, calculate the similarity between the keyword and the file name of the image file, and The image file corresponding to the image file name with the highest similarity is played, and the specific method for calculating the similarity can refer to the operation described when matching the question answer of the question text in the preset question and answer library, which will not be repeated here.
在一个实施方式中,所述屏幕展示设备与所述机器人管理平台通过一个后台服务器桥接,当所述问题答案中包含所述预设触发大屏展示的关键字时,所述机器人管理平台将所述关键字发送给所述后台服务器,所述后台服务器接收到所述关键字时,生成大屏展示指令,并将所述大屏展示指令通过预设协议发送给屏幕展示设备,所述预设协议与所述后台服务器相对应,如,后台服务器为消息队列遥测传输(MQTT)服务器,所述预设协议为消息队列遥测传输协议。In one embodiment, the screen display device and the robot management platform are bridged through a back-end server. When the answer to the question contains the preset keyword that triggers large-screen display, the robot management platform will The keyword is sent to the back-end server. When the back-end server receives the keyword, it generates a large-screen display instruction, and sends the large-screen display instruction to the screen display device through a preset protocol. The preset The protocol corresponds to the background server, for example, the background server is a message queue telemetry transmission (MQTT) server, and the preset protocol is a message queue telemetry transmission protocol.
步骤S106:向所述机器人发送问题答案。Step S106: Send the answer to the question to the robot.
在本步骤中,当所述问题答案中不包含所述预设触发大屏展示的关键字时,说明检索到的问题答案不需要使用屏幕展示设备展示,此时,将所述问题答案发送给机器人。In this step, when the answer to the question does not include the preset keyword that triggers the large-screen display, it means that the answer to the question does not need to be displayed on the screen display device. At this time, the answer to the question is sent to robot.
应理解,当机器人接收到所述问题答案时,会将所述问题答案以语音方式播放给用户,在此过程中,涉及到将文字转化成语音的过程,转化算法为现有技术,在此不再赘述。在此过程中,可以在机器人控制单元中预设转换算法,直接将转化之后的语音发送给机器人本体,也可以当机器人本体接收到所述问题答案后,由机器人控制单元控制所述机器人本体按照预设的转换算法转换成语音,具体转换方式在此不做限定。It should be understood that when the robot receives the answer to the question, it will broadcast the answer to the user in voice. In this process, the process of converting text into speech is involved. The conversion algorithm is the existing technology. No longer. In this process, the conversion algorithm can be preset in the robot control unit, and the converted voice can be directly sent to the robot body, or when the robot body receives the answer to the question, the robot control unit controls the robot body according to The preset conversion algorithm is converted into speech, and the specific conversion method is not limited here.
在本申请实施例中,通过在预设问答库中匹配问题文字的问题答案,并在所述预设问答库中预设触发大屏展示的关键字,从而实现机器人与用户之间以语音及视频同时互动,使机器人以更加形象具体的方式向用户展示对话内容。In the embodiment of the present application, by matching the answer of the question text in the preset question and answer library, and preset the keywords that trigger the large-screen display in the preset question and answer library, the voice communication between the robot and the user is realized. The video is interactive at the same time, so that the robot can show the dialogue content to the user in a more vivid and concrete way.
在一些实施例中,机器人与屏幕展示设备均可配置多个,并且预先为每个机器人分配好对应的屏幕展示设备,当一机器人需要屏幕展示设备协助展示时,控制对应的屏幕展示设备进行显示,如图3所示,步骤S105中所述向屏幕展示设备发送大屏展示指令包括以下步骤:In some embodiments, multiple robots and screen display devices can be configured, and a corresponding screen display device is assigned to each robot in advance. When a robot needs a screen display device to assist in the display, the corresponding screen display device is controlled to display As shown in Figure 3, the step S105 sending a large-screen display instruction to the screen display device includes the following steps:
步骤S301:获取发送所述问题语音的机器人身份标号。Step S301: Obtain the identification number of the robot sending the question voice.
在本步骤中,所述机器人预先设置有身份标号,当所述机器人接收到用户发送的问题语音后,将所述问题语音以携带机器人身份标号的方式发送给机器 人管理平台。In this step, the robot is preset with an identity label, and when the robot receives the question voice sent by the user, the question voice is sent to the robot management platform by carrying the robot identity label.
步骤S302:向所述屏幕展示设备发送大屏展示指令,其中,所述屏幕展示设备与所述机器人身份标号相关联。Step S302: Send a large-screen display instruction to the screen display device, where the screen display device is associated with the robot identity label.
在本步骤中,所述屏幕展示设备预先与所述机器人身份标号相关联,当所述问题答案中含有所述触发大屏展示的关键字时,机器人管理平台向所述与所述机器人身份标号相关联的屏幕展示设备发送大屏展示指令,以使与所述机器人身份相关联的屏幕展示设备根据所述大屏展示指令进行展示。如,有两台机器人及三台屏幕展示设备,两台机器人的身份标号分别为1和2,所述三台屏幕展示设备分别记为A、B、C,所述大屏展示设备A和B订阅了机器人1发送的问题语音,大屏展示设备C订阅了机器人2发送的问题语音,则当机器人1接收到的问题语音中含有触发大屏展示的关键字时,所述机器人管理平台将携带所述关键字的大屏展示指令发送给屏幕展示设备A和B,而不会发送给C。In this step, the screen display device is pre-associated with the robot identity label, and when the answer to the question contains the keyword that triggers the large-screen display, the robot management platform will mark the identity label with the robot. The associated screen display device sends a large-screen display instruction, so that the screen display device associated with the robot identity performs display according to the large-screen display instruction. For example, there are two robots and three screen display devices, the identification numbers of the two robots are 1 and 2, respectively, the three screen display devices are marked as A, B, C, and the large-screen display devices A and B Subscribing to the question voice sent by robot 1, and large-screen display device C subscribes to the question voice sent by robot 2, and when the question voice received by robot 1 contains keywords that trigger the large-screen display, the robot management platform will carry The large-screen display instruction of the keyword is sent to the screen display devices A and B, but not to C.
应理解的是:所述大屏展示设备可以根据其关联的机器人的用途设置展示库,如,在医院中,机器人1用于导诊,则预设展示库可以设置与医院各个部门位置相关的视频文件。It should be understood that: the large-screen display device can set a display library according to the use of its associated robot. For example, in a hospital, the robot 1 is used for diagnosis, and the preset display library can be set to be related to the location of each department of the hospital. Video files.
在本实施例中,通过将屏幕展示设备与机器人身份标号相关联,从而实现了特定的机器人发送的问题语音中包含的大屏展示关键字对应的影音在特定的大屏展示设备上播放,保证了屏幕展示内容的安全性及屏幕展示的高效性。In this embodiment, by associating the screen display device with the robot identity label, it is realized that the video and audio corresponding to the large-screen display keywords contained in the question voice sent by the specific robot are played on the specific large-screen display device, ensuring that The security of the content displayed on the screen and the efficiency of the screen display are improved.
图4是本申请一种人机交互装置的功能框图,如图4所示,所述装置包括:获取模块401、转换模块402、匹配模块403、判断模块404及发送模块405,其中,所述获取模块401,用于获取机器人发送的用户的问题语音;转换模块402,用于将所述问题语音转换成问题文字;匹配模块403,用于在预设问答库中匹配所述问题文字的问题答案;判断模块404,用于判断所述问题答案中是否包含预设触发大屏展示的关键字;发送模块405,用于当所述问题答案中包含预设触发大屏展示的关键字时,向所述机器人发送所述问题答案,并且向屏幕展示设备发送大屏展示指令,以使所述屏幕展示设备根据所述大屏展示指令进行显示,其中,所述大屏展示指令携带所述关键字。Fig. 4 is a functional block diagram of a human-computer interaction device of the present application. As shown in Fig. 4, the device includes: an acquisition module 401, a conversion module 402, a matching module 403, a judgment module 404, and a sending module 405, wherein the The obtaining module 401 is used to obtain the user's question voice sent by the robot; the conversion module 402 is used to convert the question voice into question text; the matching module 403 is used to match the question text of the question text in a preset question and answer library Answer; judging module 404, for judging whether the answer to the question contains a keyword preset to trigger large-screen display; sending module 405, for when the answer to the question contains a keyword preset to trigger large-screen display, Send the answer to the question to the robot, and send a large-screen display instruction to the screen display device, so that the screen display device displays according to the large-screen display instruction, wherein the large-screen display instruction carries the key word.
其中,所述匹配模块403包括:拆分单元4031、搜索单元4032、计算单元4033及确定单元4034,其中拆分单元4031,用于使用预设分词算法将所述问题文字拆分为多个词语;搜索单元4032,用于在预设问答库中搜索包含至少一个 所述词语的语句;计算单元4033,用于计算所述语句和所述问题文字的相似度,得到相似度值;确定单元4034,用于将所述相似度值最高的语句对应的答复作为所述问题文字的问题答案。Wherein, the matching module 403 includes: a splitting unit 4031, a searching unit 4032, a calculation unit 4033, and a determining unit 4034. The splitting unit 4031 is used to split the question text into multiple words using a preset word segmentation algorithm The searching unit 4032 is used to search for sentences containing at least one of the words in the preset question and answer library; the calculating unit 4033 is used to calculate the similarity between the sentences and the question text to obtain the similarity value; the determining unit 4034 , Used to use the answer corresponding to the sentence with the highest similarity value as the answer to the question text.
其中,所述发送模块405包括:搜索单元4051及显示单元4052,其中,所述搜索单元4051,用于根据所述大屏展示指令在预设展示库中搜索包含所述关键字的影像文件;所述显示单元4052,用于显示搜索到所述影像文件。Wherein, the sending module 405 includes a search unit 4051 and a display unit 4052. The search unit 4051 is configured to search for an image file containing the keyword in a preset display library according to the large-screen display instruction; The display unit 4052 is configured to display the searched image file.
其中,当所述机器人与所述屏幕展示设备的数量均为多个时,所述发送模块还包括:获取单元4053及发送单元4054,其中,所述获取单元4053,用于获取发送所述问题语音的机器人身份标号;所述发送单元4054,用于向所述屏幕展示设备发送大屏展示指令,其中,所述屏幕展示设备与所述机器人身份标号相关联。Wherein, when the number of the robot and the screen display device are both multiple, the sending module further includes: an acquiring unit 4053 and a sending unit 4054, wherein the acquiring unit 4053 is configured to acquire and send the question The voice robot identity label; the sending unit 4054 is configured to send a large-screen display instruction to the screen display device, where the screen display device is associated with the robot identity label.
本申请实施例中,通过判断模块判断预设问答库中是否包含预设触发大屏展示的关键字,当包含所述关键字时,通过发送模块向屏幕展示设备发送大屏展示指令,以使所述屏幕展示设备展示所述关键字相关的内容,从而实现了机器人在与人对话的过程中,可以通过屏幕展示设备进行视频形象生动的展示;此外,当存在多台机器人及大屏展示设备时,通过获取单元获取机器人的身份标号,并通过发送模块向与所述机器人的身份标号相关联的大屏展示设备发送大屏展示指令,从而实现了特定的机器人发送的问题语音包含的屏幕展示关键字对应的影音文件在特定的大屏展示设备上展示,保证了屏幕展示内容的安全性。In the embodiment of the present application, the judgment module is used to judge whether the preset Q&A library contains a preset keyword that triggers large-screen display, and when the keyword is contained, the large-screen display instruction is sent to the screen display device through the sending module, so that The screen display device displays the content related to the keyword, so that the robot can display the video image vividly through the screen display device during the dialogue with the human; in addition, when there are multiple robots and large-screen display devices At the time, the robot’s identity label is obtained through the acquisition unit, and the large-screen display instruction is sent to the large-screen display device associated with the robot’s identity label through the sending module, thereby realizing the screen display contained in the question voice sent by the specific robot The video and audio files corresponding to the keywords are displayed on a specific large-screen display device to ensure the security of the content displayed on the screen.
本申请实施例提供了一种非易失性计算机可读存储介质,所述计算机可读存储介质存储有至少一可执行指令,该计算机可执行指令可执行上述任意方法实施例中的一种人机交互方法。The embodiments of the present application provide a non-volatile computer-readable storage medium, the computer-readable storage medium stores at least one executable instruction, and the computer-executable instruction can execute one of the above-mentioned method embodiments. Machine interaction method.
图5为本申请计算设备实施例的结构示意图,本申请具体实施例并不对计算设备的具体实现做限定。FIG. 5 is a schematic structural diagram of an embodiment of a computing device of this application, and the specific embodiment of this application does not limit the specific implementation of the computing device.
如图5所示,该计算设备可以包括:处理器(processor)502、通信接口(Communications Interface)504、存储器(memory)506、以及通信总线508。As shown in FIG. 5, the computing device may include: a processor (processor) 502, a communication interface (Communications Interface) 504, a memory (memory) 506, and a communication bus 508.
其中:among them:
处理器502、通信接口504、以及存储器506通过通信总线508完成相互间的通信。The processor 502, the communication interface 504, and the memory 506 communicate with each other through the communication bus 508.
通信接口504,用于与其它设备通信。The communication interface 504 is used to communicate with other devices.
处理器502,用于执行程序510,具体可以执行上述一种人机交互方法实施例中的相关步骤。The processor 502 is configured to execute the program 510, and specifically can execute the relevant steps in the foregoing embodiment of the human-computer interaction method.
具体地,程序510可以包括程序代码,该程序代码包括计算机操作指令。Specifically, the program 510 may include program code, and the program code includes computer operation instructions.
处理器502可能是中央处理器CPU,或者是特定集成电路ASIC(Application Specific Integrated Circuit),或者是被配置成实施本申请实施例的一个或多个集成电路。计算设备包括的一个或多个处理器,可以是同一类型的处理器,如一个或多个CPU;也可以是不同类型的处理器,如一个或多个CPU以及一个或多个ASIC。The processor 502 may be a central processing unit CPU, or an ASIC (Application Specific Integrated Circuit), or one or more integrated circuits configured to implement the embodiments of the present application. The one or more processors included in the computing device may be the same type of processor, such as one or more CPUs, or different types of processors, such as one or more CPUs and one or more ASICs.
存储器506,用于存放程序510。存储器506可能包含高速RAM存储器,也可能还包括非易失性存储器(non-volatile memory),例如至少一个磁盘存储器。The memory 506 is used to store the program 510. The memory 506 may include a high-speed RAM memory, and may also include a non-volatile memory (non-volatile memory), for example, at least one disk memory.
程序510具体可以用于使得处理器502执行以下操作:The program 510 may be specifically used to cause the processor 502 to perform the following operations:
获取机器人发送的用户的问题语音;将所述问题语音转换成问题文字;在预设问答库中匹配所述问题文字的问题答案;判断所述问题答案中是否包含预设触发大屏展示的关键字;若包含,向所述机器人发送所述问题答案,并且向屏幕展示设备发送大屏展示指令,以使所述屏幕展示设备根据所述大屏展示指令进行显示,其中,所述大屏展示指令携带所述关键字。Obtain the user’s question voice sent by the robot; convert the question voice into question text; match the question answer of the question text in the preset question and answer library; determine whether the question answer contains the key to trigger the large-screen display by default If it contains, send the answer to the question to the robot, and send a large-screen display instruction to the screen display device, so that the screen display device displays according to the large-screen display instruction, wherein the large-screen display The instruction carries the keyword.
在一种可选的方式中,程序510具体可以进一步用于使得处理器502执行以下操作:In an optional manner, the program 510 may be further specifically used to cause the processor 502 to perform the following operations:
使用预设分词算法将所述问题文字拆分为多个词语;在预设问答库中搜索包含至少一个所述词语的语句;计算所述语句和所述问题文字的相似度,得到相似度值;将所述相似度值最高的语句对应的答复作为所述问题文字的问题答案。Use a preset word segmentation algorithm to split the question text into multiple words; search a preset question and answer library for sentences containing at least one of the words; calculate the similarity between the sentence and the question text to obtain a similarity value ; Use the answer corresponding to the sentence with the highest similarity value as the answer to the question text.
在一种可选的方式中,程序510具体可以进一步用于使得处理器502执行以下操作:根据所述大屏展示指令在预设展示库中搜索包含所述关键字的影像文件;显示搜索到所述影像文件。In an optional manner, the program 510 may be further configured to cause the processor 502 to perform the following operations: according to the large-screen display instruction, search for an image file containing the keyword in a preset display library; The image file.
在一种可选的方式中,当所述机器人与所述屏幕展示设备的数量均为多个时,程序510具体可以进一步用于使得处理器502执行以下操作:获取发送所述问题语音的机器人身份标号;向所述屏幕展示设备发送大屏展示指令,其中, 所述屏幕展示设备与所述机器人身份标号相关联。In an optional manner, when the numbers of the robot and the screen display device are both multiple, the program 510 may be further specifically configured to cause the processor 502 to perform the following operations: Obtain the robot that sent the question voice Identity label; sending a large-screen display instruction to the screen display device, wherein the screen display device is associated with the robot identity label.
在此提供的算法和显示不与任何特定计算机、虚拟系统或者其它设备固有相关。各种通用系统也可以与基于在此的示教一起使用。根据上面的描述,构造这类系统所要求的结构是显而易见的。此外,本申请也不针对任何特定编程语言。应当明白,可以利用各种编程语言实现在此描述的本申请的内容,并且上面对特定语言所做的描述是为了披露本申请的最佳实施例。The algorithms and displays provided here are not inherently related to any particular computer, virtual system or other equipment. Various general-purpose systems can also be used with the teaching based on this. From the above description, the structure required to construct this type of system is obvious. In addition, this application is not aimed at any specific programming language. It should be understood that various programming languages can be used to implement the content of the application described herein, and the above description of a specific language is for disclosing the best embodiment of the application.
在此处所提供的说明书中,说明了大量具体细节。然而,能够理解,本申请的实施例可以在没有这些具体细节的情况下实践。在一些实例中,并未详细示出公知的方法、结构和技术,以便不模糊对本说明书的理解。The specification provided here explains a lot of specific details. However, it can be understood that the embodiments of the present application can be practiced without these specific details. In some instances, well-known methods, structures, and techniques have not been shown in detail so as not to obscure the understanding of this description.
类似地,应当理解,为了精简本公开并帮助理解各个发明方面中的一个或多个,在上面对本申请的示例性实施例的描述中,本申请的各个特征有时被一起分组到单个实施例、图、或者对其的描述中。然而,并不应将该公开的方法解释成反映如下意图:即所要求保护的本申请要求比在每个权利要求中所明确记载的特征更多的特征。更确切地说,如权利要求书所反映的那样,发明方面在于少于前面公开的单个实施例的所有特征。因此,遵循具体实施例的权利要求书由此明确地并入该具体实施例,其中每个权利要求本身都作为本申请的单独实施例。Similarly, it should be understood that in order to simplify the present disclosure and help understand one or more of the various inventive aspects, in the above description of the exemplary embodiments of the present application, the various features of the present application are sometimes grouped together into a single embodiment, Figure, or its description. However, the disclosed method should not be construed to reflect the intention that the claimed application requires more features than the features explicitly recorded in each claim. More precisely, as reflected in the claims, the inventive aspect lies in less than all the features of a single embodiment disclosed previously. Therefore, the claims following the specific embodiment are thus explicitly incorporated into the specific embodiment, wherein each claim itself serves as a separate embodiment of the application.
本领域那些技术人员可以理解,可以对实施例中的设备中的模块进行自适应性地改变并且把它们设置在与该实施例不同的一个或多个设备中。可以把实施例中的模块或单元或组件组合成一个模块或单元或组件,以及此外可以把它们分成多个子模块或子单元或子组件。除了这样的特征和/或过程或者单元中的至少一些是相互排斥之外,可以采用任何组合对本说明书(包括伴随的权利要求、摘要和附图)中公开的所有特征以及如此公开的任何方法或者设备的所有过程或单元进行组合。除非另外明确陈述,本说明书(包括伴随的权利要求、摘要和附图)中公开的每个特征可以由提供相同、等同或相似目的的替代特征来代替。Those skilled in the art can understand that it is possible to adaptively change the modules in the device in the embodiment and set them in one or more devices different from the embodiment. The modules or units or components in the embodiments may be combined into one module or unit or component, and in addition, they may be divided into a plurality of submodules or subunits or subcomponents. Except that at least some of such features and/or processes or units are mutually exclusive, all features disclosed in this specification (including the accompanying claims, abstract and drawings) and any method so disclosed may be adopted in any combination All processes or units of equipment are combined. Unless expressly stated otherwise, each feature disclosed in this specification (including the accompanying claims, abstract and drawings) may be replaced by alternative features serving the same, equivalent or similar purpose.
此外,本领域的技术人员能够理解,尽管在此所述的一些实施例包括其它实施例中所包括的某些特征而不是其它特征,但是不同实施例的特征的组合意味着处于本申请的范围之内并且形成不同的实施例。例如,在下面的权利要求书中,所要求保护的实施例的任意之一都可以以任意的组合方式来使用。In addition, those skilled in the art can understand that although some embodiments described herein include certain features included in other embodiments but not other features, the combination of features of different embodiments means that they are within the scope of the present application. Within and form different embodiments. For example, in the following claims, any one of the claimed embodiments can be used in any combination.
本申请的各个部件实施例可以以硬件实现,或者以在一个或者多个处理器上运行的软件模块实现,或者以它们的组合实现。本领域的技术人员应当理解,可以在实践中使用微处理器或者数字信号处理器(DSP)来实现根据本申请实施例的一种人机交互装置中的一些或者全部部件的一些或者全部功能。本申请还可以实现为用于执行这里所描述的方法的一部分或者全部的设备或者装置程序(例如,计算机程序和计算机程序产品)。这样的实现本申请的程序可以存储在计算机可读介质上,或者可以具有一个或者多个信号的形式。这样的信号可以从因特网网站上下载得到,或者在载体信号上提供,或者以任何其他形式提供。Each component embodiment of the present application may be implemented by hardware, or by software modules running on one or more processors, or by a combination of them. Those skilled in the art should understand that a microprocessor or a digital signal processor (DSP) may be used in practice to implement some or all of the functions of some or all of the components in the human-computer interaction device according to the embodiments of the present application. The application can also be implemented as a device or device program (for example, a computer program and a computer program product) for executing part or all of the methods described herein. Such a program for implementing the present application may be stored on a computer-readable medium, or may have the form of one or more signals. Such a signal can be downloaded from an Internet website, or provided on a carrier signal, or provided in any other form.
应该注意的是上述实施例对本申请进行说明而不是对本申请进行限制,并且本领域技术人员在不脱离所附权利要求的范围的情况下可设计出替换实施例。在权利要求中,不应将位于括号之间的任何参考符号构造成对权利要求的限制。单词“包含”不排除存在未列在权利要求中的元件或步骤。位于元件之前的单词“一”或“一个”不排除存在多个这样的元件。本申请可以借助于包括有若干不同元件的硬件以及借助于适当编程的计算机来实现。在列举了若干装置的单元权利要求中,这些装置中的若干个可以是通过同一个硬件项来具体体现。单词第一、第二、以及第三等的使用不表示任何顺序。可将这些单词解释为名称。It should be noted that the above-mentioned embodiments illustrate rather than limit the application, and those skilled in the art can design alternative embodiments without departing from the scope of the appended claims. In the claims, any reference signs between parentheses should not be constructed as limitations on the claims. The word "comprising" does not exclude the presence of elements or steps not listed in the claims. The word "a" or "an" preceding an element does not exclude the presence of multiple such elements. The application can be implemented by means of hardware including several different elements and by means of a suitably programmed computer. In the unit claims enumerating several devices, several of these devices may be embodied by the same hardware item. The use of the words first, second, and third does not indicate any order. These words can be interpreted as names.

Claims (10)

  1. 一种人机交互方法,其特征在于,包括:A human-computer interaction method, characterized in that it comprises:
    获取机器人发送的用户的问题语音;Get the user's question voice sent by the robot;
    将所述问题语音转换成问题文字;Convert the question voice into question text;
    在预设问答库中匹配所述问题文字的问题答案;The answer to the question matching the question text in the preset question and answer library;
    判断所述问题答案中是否包含预设触发大屏展示的关键字;Determine whether the answer to the question contains keywords that preset to trigger large-screen display;
    若包含,向所述机器人发送所述问题答案,并且向屏幕展示设备发送大屏展示指令,以使所述屏幕展示设备根据所述大屏展示指令进行显示,其中,所述大屏展示指令携带所述关键字。If it does, send the answer to the question to the robot, and send a large-screen display instruction to the screen display device, so that the screen display device displays according to the large-screen display instruction, wherein the large-screen display instruction carries The keywords.
  2. 根据权利要求1所述的方法,其特征在于,所述在预设问答库中匹配所述问题文字的问题答案包括:The method according to claim 1, wherein the question answer matching the question text in the preset question answering library comprises:
    使用预设分词算法将所述问题文字拆分为多个词语;Use a preset word segmentation algorithm to split the question text into multiple words;
    在预设问答库中搜索包含至少一个所述词语的语句;Searching for sentences containing at least one of the words in the preset question and answer library;
    计算所述语句和所述问题文字的相似度,得到相似度值;Calculate the similarity between the sentence and the question text to obtain the similarity value;
    将所述相似度值最高的语句对应的答复作为所述问题文字的问题答案。The answer corresponding to the sentence with the highest similarity value is used as the answer to the question text.
  3. 根据权利要求1所述的方法,其特征在于,所述屏幕展示设备根据所述大屏展示指令进行显示,其中,所述大屏展示指令携带所述关键字包括:The method according to claim 1, wherein the screen display device displays according to the large-screen display instruction, wherein the large-screen display instruction carrying the keyword includes:
    根据所述大屏展示指令在预设展示库中搜索包含所述关键字的影像文件;Searching for an image file containing the keyword in a preset display library according to the large-screen display instruction;
    显示搜索到所述影像文件。Display the searched image file.
  4. 根据权利要求1所述的方法,其特征在于,当所述机器人与所述屏幕展示设备的数量均为多个时,所述向屏幕展示设备发送大屏展示指令,还包括:The method according to claim 1, wherein when the number of the robot and the screen display device are both multiple, the sending a large-screen display instruction to the screen display device further comprises:
    获取发送所述问题语音的机器人身份标号;Acquiring the identification number of the robot that sent the question voice;
    向所述屏幕展示设备发送大屏展示指令,其中,所述屏幕展示设备与所述机器人身份标号相关联。Send a large-screen display instruction to the screen display device, where the screen display device is associated with the robot identity label.
  5. 一种人机交互装置,其特征在于,包括:A human-computer interaction device, characterized in that it comprises:
    获取模块:用于获取机器人发送的用户的问题语音;Acquisition module: used to acquire the user's question voice sent by the robot;
    转换模块:用于将所述问题语音转换成问题文字;Conversion module: used to convert the question voice into question text;
    匹配模块:用于在预设问答库中匹配所述问题文字的问题答案;Matching module: used to match the question answer of the question text in the preset question and answer library;
    判断模块:用于判断所述问题答案中是否包含预设触发大屏展示的关键字;Judgment module: used to judge whether the answer to the question contains a keyword preset to trigger a large-screen display;
    发送模块:用于当所述问题答案中包含预设触发大屏展示的关键字时,向所述机器人发送所述问题答案,并且向屏幕展示设备发送大屏展示指令,以使所述屏幕展示设备根据所述大屏展示指令进行显示,其中,所述大屏展示指令携带所述关键字。Sending module: when the answer to the question contains a keyword that is preset to trigger large-screen display, send the answer to the question to the robot, and send a large-screen display instruction to the screen display device to make the screen display The device displays according to the large-screen display instruction, where the large-screen display instruction carries the keyword.
  6. 根据权利要求5所述的装置,其特征在于,所述匹配模块包括:The device according to claim 5, wherein the matching module comprises:
    拆分单元:用于使用预设分词算法将所述问题文字拆分为多个词语;Splitting unit: used to split the question text into multiple words using a preset word segmentation algorithm;
    搜索单元:用于在预设问答库中搜索包含至少一个所述词语的语句;Search unit: used to search for sentences containing at least one of the words in the preset question and answer library;
    计算单元:用于计算所述语句和所述问题文字的相似度,得到相似度值;Calculation unit: used to calculate the similarity between the sentence and the question text to obtain the similarity value;
    确定单元:用于将所述相似度值最高的语句对应的答复作为所述问题文字的问题答案。The determining unit: used to use the answer corresponding to the sentence with the highest similarity value as the answer to the question text.
  7. 根据权利要求5所述的装置,其特征在于,所述发送模块包括:The device according to claim 5, wherein the sending module comprises:
    搜索单元:用于根据所述大屏展示指令在预设展示库中搜索包含所述关键字的影像文件;Searching unit: used to search for an image file containing the keyword in a preset display library according to the large-screen display instruction;
    显示单元:用于显示搜索到所述影像文件。Display unit: used to display the searched image file.
  8. 根据权利要求5所述的装置,其特征在于,当所述机器人与所述屏幕展示设备的数量均为多个时,所述发送模块还包括:The apparatus according to claim 5, wherein when the number of the robot and the screen display device are both multiple, the sending module further comprises:
    获取单元:用于获取发送所述问题语音的机器人身份标号;Acquisition unit: used to acquire the identification number of the robot that sent the question voice;
    发送单元:用于向所述屏幕展示设备发送大屏展示指令,其中,所述屏幕展示设备与所述机器人身份标号相关联。Sending unit: used to send a large-screen display instruction to the screen display device, wherein the screen display device is associated with the robot identity label.
  9. 一种计算设备,包括:处理器、存储器、通信接口和通信总线,所述处理器、所述存储器和所述通信接口通过所述通信总线完成相互间的通信;A computing device includes: a processor, a memory, a communication interface, and a communication bus. The processor, the memory, and the communication interface communicate with each other through the communication bus;
    所述存储器用于存放至少一可执行指令,所述可执行指令使所述处理器执行如权利要求1-4中任一项所述的一种人机交互方法对应的操作。The memory is used to store at least one executable instruction, and the executable instruction causes the processor to perform an operation corresponding to a human-computer interaction method according to any one of claims 1-4.
  10. 一种计算机可读存储介质,所述存储介质中存储有至少一可执行指令,所述可执行指令使处理器执行如权利要求1-4中任一项所述的一种人机交互方法对应的操作。A computer-readable storage medium, wherein at least one executable instruction is stored in the storage medium, and the executable instruction causes a processor to execute the corresponding human-computer interaction method according to any one of claims 1-4 Operation.
PCT/CN2019/116091 2019-01-14 2019-11-06 Human-computer interaction method and apparatus, computing device, and computer-readable storage medium WO2020147380A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910030580.1A CN109947911B (en) 2019-01-14 2019-01-14 Man-machine interaction method and device, computing equipment and computer storage medium
CN201910030580.1 2019-01-14

Publications (1)

Publication Number Publication Date
WO2020147380A1 true WO2020147380A1 (en) 2020-07-23

Family

ID=67007300

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/116091 WO2020147380A1 (en) 2019-01-14 2019-11-06 Human-computer interaction method and apparatus, computing device, and computer-readable storage medium

Country Status (2)

Country Link
CN (1) CN109947911B (en)
WO (1) WO2020147380A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112735413A (en) * 2020-12-25 2021-04-30 浙江大华技术股份有限公司 Instruction analysis method based on camera device, electronic equipment and storage medium
CN112820314A (en) * 2021-01-11 2021-05-18 润联软件系统(深圳)有限公司 Intelligent voice control large screen display method, system and related components thereof
CN113362827A (en) * 2021-06-24 2021-09-07 未鲲(上海)科技服务有限公司 Speech recognition method, speech recognition device, computer equipment and storage medium
US20220206671A1 (en) * 2020-12-25 2022-06-30 Toyota Jidosha Kabushiki Kaisha Agent display method, non-transitory computer readable medium, and agent display system
CN115130022A (en) * 2022-07-04 2022-09-30 北京字跳网络技术有限公司 Content search method, device, equipment and medium

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109947911B (en) * 2019-01-14 2023-06-16 达闼机器人股份有限公司 Man-machine interaction method and device, computing equipment and computer storage medium
CN110706704A (en) * 2019-10-17 2020-01-17 四川长虹电器股份有限公司 Method, device and computer equipment for generating voice interaction prototype

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104461525A (en) * 2014-11-27 2015-03-25 韩慧健 Intelligent user-defined consulting platform generating system
CN106847278A (en) * 2012-12-31 2017-06-13 威盛电子股份有限公司 System of selection and its mobile terminal apparatus and information system based on speech recognition
CN108628939A (en) * 2017-03-22 2018-10-09 卡西欧计算机株式会社 The storage medium that information display device, method for information display and computer capacity are read
CN109947911A (en) * 2019-01-14 2019-06-28 深圳前海达闼云端智能科技有限公司 A kind of man-machine interaction method, calculates equipment and computer storage medium at device

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101145289A (en) * 2007-09-13 2008-03-19 上海交通大学 Remote teaching environment voice answering system based on proxy technology
CN107315766A (en) * 2017-05-16 2017-11-03 广东电网有限责任公司江门供电局 A kind of voice response method and its device for gathering intelligence and artificial question and answer
CN108010531B (en) * 2017-12-14 2021-07-27 南京美桥信息科技有限公司 Visual intelligent inquiry method and system
CN108415995B (en) * 2018-02-13 2022-04-22 北京百度网讯科技有限公司 Searching method and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106847278A (en) * 2012-12-31 2017-06-13 威盛电子股份有限公司 System of selection and its mobile terminal apparatus and information system based on speech recognition
CN104461525A (en) * 2014-11-27 2015-03-25 韩慧健 Intelligent user-defined consulting platform generating system
CN108628939A (en) * 2017-03-22 2018-10-09 卡西欧计算机株式会社 The storage medium that information display device, method for information display and computer capacity are read
CN109947911A (en) * 2019-01-14 2019-06-28 深圳前海达闼云端智能科技有限公司 A kind of man-machine interaction method, calculates equipment and computer storage medium at device

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112735413A (en) * 2020-12-25 2021-04-30 浙江大华技术股份有限公司 Instruction analysis method based on camera device, electronic equipment and storage medium
US20220206671A1 (en) * 2020-12-25 2022-06-30 Toyota Jidosha Kabushiki Kaisha Agent display method, non-transitory computer readable medium, and agent display system
CN112820314A (en) * 2021-01-11 2021-05-18 润联软件系统(深圳)有限公司 Intelligent voice control large screen display method, system and related components thereof
CN113362827A (en) * 2021-06-24 2021-09-07 未鲲(上海)科技服务有限公司 Speech recognition method, speech recognition device, computer equipment and storage medium
CN113362827B (en) * 2021-06-24 2024-02-13 上海风和雨网络科技有限公司 Speech recognition method, device, computer equipment and storage medium
CN115130022A (en) * 2022-07-04 2022-09-30 北京字跳网络技术有限公司 Content search method, device, equipment and medium

Also Published As

Publication number Publication date
CN109947911B (en) 2023-06-16
CN109947911A (en) 2019-06-28

Similar Documents

Publication Publication Date Title
WO2020147380A1 (en) Human-computer interaction method and apparatus, computing device, and computer-readable storage medium
US10997258B2 (en) Bot networks
US20200402515A1 (en) Dialog management with multiple modalities
CN112291203B (en) Locally saving data for voice actions with selective offline capability
US10224035B1 (en) Voice search assistant
WO2019100738A1 (en) Multi-participant human-machine interaction method and device
CN114207710A (en) Detecting and/or registering a thermal command to trigger a response action by an automated assistant
US11922945B2 (en) Voice to text conversion based on third-party agent content
JP6730994B2 (en) Question/answer information processing method, device, storage medium, and device
US20190018694A1 (en) Virtual laboratory assistant platform
JP2020521210A (en) Information processing method and terminal, computer storage medium
WO2021175007A1 (en) Online customer service consultation method and apparatus, medium, and electronic device
US20190065498A1 (en) System and method for rich conversation in artificial intelligence
WO2020253064A1 (en) Speech recognition method and apparatus, and computer device and storage medium
WO2021063089A1 (en) Rule matching method, rule matching apparatus, storage medium and electronic device
KR20200124298A (en) Mitigate client device latency when rendering remotely generated automated assistant content
CN114064943A (en) Conference management method, conference management device, storage medium and electronic equipment
WO2023272616A1 (en) Text understanding method and system, terminal device, and storage medium
JP7288885B2 (en) Voice interaction method, device, equipment and storage medium
WO2023040545A1 (en) Data processing method and apparatus, device, storage medium, and program product
WO2021170094A1 (en) Method and device for information interaction
US20190221295A1 (en) Remote Medical Analysis Systems And Methods
US20210264910A1 (en) User-driven content generation for virtual assistant
US20230153061A1 (en) Hierarchical Context Specific Actions from Ambient Speech
US11853975B1 (en) Contextual parsing of meeting information

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19910261

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19910261

Country of ref document: EP

Kind code of ref document: A1