WO2020147380A1

WO2020147380A1 - Human-computer interaction method and apparatus, computing device, and computer-readable storage medium

Info

Publication number: WO2020147380A1
Application number: PCT/CN2019/116091
Authority: WO
Inventors: 王树良; 马世奎; 孙文豹
Original assignee: 深圳前海达闼云端智能科技有限公司
Priority date: 2019-01-14
Filing date: 2019-11-06
Publication date: 2020-07-23
Also published as: CN109947911B; CN109947911A

Abstract

The present application relates to the technical field of human-computer interaction, in particular, to a human-computer interaction method and apparatus, a computing device, and a computer-readable storage medium. The method comprises: obtaining a question speech of a user sent by a robot; converting the question speech into question text; matching a question answer of the question text in a preset question and answer bank; determining whether the question answer comprises a preset keyword for triggering large-screen display; and if yes, sending the question answer to the robot, and sending a large-screen display instruction to a screen display device, so that the screen display device performs display according to the large-screen display instruction, wherein the large-screen display instruction carries the keyword. Thus, the use of the solution of the present application can make a robot communicate with a person by means of screen display.

Description

Human-computer interaction method, device, computing equipment and computer readable storage medium

Technical field

The embodiments of the present application relate to the field of human-computer interaction technology, and in particular, to a human-computer interaction method, device, computing device, and computer-readable storage medium.

Background technique

Human-computer interaction is a process of information exchange between people and computers using a certain language to complete certain tasks in a certain interactive manner. With the development of intelligent technology, breakthroughs have been made in the research of intelligent robots, which have been widely used in technical fields such as family life, medical treatment, and industry, and the interaction between humans and robots has become more and more diversified, such as, Text interaction, voice interaction, etc., among them, voice interaction is currently a main way of interaction between humans and robots.

In the process of realizing this application, the inventor of the present application found that: existing robots communicate with people through text or voice, and cannot display images related to the conversation content through the screen.

Application content

In view of the above problems, this application is proposed to provide a human-computer interaction method, device, computing device, and computer-readable storage medium that overcome the above problems or at least partially solve the above problems.

In order to solve the above technical problems, a technical solution adopted in the embodiments of this application is to provide a human-computer interaction method, including obtaining the user's question voice sent by the robot; converting the question voice into question text; The answer to the question that matches the text of the question in the question; judge whether the answer to the question contains a keyword that is preset to trigger a large-screen display; if it does, send the answer to the question to the robot and send a large-screen display to the screen display device Instructions to cause the screen display device to display according to the large-screen display instruction, wherein the large-screen display instruction carries the keyword.

Optionally, the question answer matching the question text in the preset question and answer library includes: using a preset word segmentation algorithm to split the question text into multiple words; searching in the preset question and answer library containing at least one question Calculate the similarity between the sentence and the question text to obtain the similarity value; use the answer corresponding to the sentence with the highest similarity value as the question answer of the question text.

Optionally, the screen display device performs display according to the large-screen display instruction, wherein the carrying of the keyword in the large-screen display instruction includes: searching a preset display library according to the large-screen display instruction to include all The image file of the keyword; display the image file searched.

Optionally, when the number of the screen display devices of the robot is multiple, the sending a large-screen display instruction to the screen display device may further include: obtaining the identification number of the robot that sends the question voice; The screen display device sends a large-screen display instruction, wherein the screen display device is associated with the robot identity label.

Another technical solution adopted in the embodiments of the present application is to provide a human-computer interaction device, including: an acquisition module: used to acquire the user’s question voice sent by the robot; and a conversion module: used to convert the question voice into question text ; Matching module: used to match the answer to the question text in the preset question and answer library; Judging module: used to judge whether the answer to the question contains a preset keyword that triggers large-screen display; Sending module: Used for When the answer to the question contains a keyword that is preset to trigger a large-screen display, the answer to the question is sent to the robot, and a large-screen display instruction is sent to the screen display device, so that the screen display device responds to the large-screen display The display instruction is displayed, wherein the large-screen display instruction carries the keyword.

Optionally, the matching module includes: a splitting unit: used to split the question text into multiple words using a preset word segmentation algorithm; a search unit: used to search a preset question and answer library that contains at least one of the The sentence of the word; calculation unit: used to calculate the similarity between the sentence and the question text to obtain the similarity value; the determination unit: used to take the answer corresponding to the sentence with the highest similarity value as the question text Answer.

Optionally, the sending module includes: a search unit: used to search for an image file containing the keyword in a preset display library according to the large-screen display instruction; a display unit: used to display the searched image file .

Optionally, when the numbers of the robot and the screen display device are both multiple, the sending module further includes:

Obtaining unit: used to obtain the identity label of the robot that sent the question voice; sending unit: used to send a large-screen display instruction to the screen display device, wherein the screen display device is associated with the robot identity label.

Another technical solution adopted in the embodiments of the present application is to provide a computing device, including: a processor, a memory, a communication interface, and a communication bus. The processor, the memory, and the communication interface are completed through the communication bus. Mutual communication; the memory is used to store at least one executable instruction, and the executable instruction causes the processor to perform operations corresponding to a human-computer interaction method.

Another technical solution adopted in the embodiments of the present application is to provide a computer-readable storage medium in which at least one executable instruction is stored, and the executable instruction causes a processor to execute a corresponding human-computer interaction method. Operation.

The beneficial effect of the embodiment of the present application is: different from the prior art, the embodiment of the present application presets keywords that trigger the large-screen display in the preset question and answer library. When the robot is in the process of voice dialogue with the human, the answer to the question is When the keyword is included, the screen is triggered to display the content related to the keyword, so that in addition to the voice interaction between the robot and the human, the related content can be vividly displayed through the video image; in addition, when there are multiple robots And large-screen display equipment, by associating the large-screen display device with the robot’s identity label, it is possible to display the video and audio files corresponding to the screen display keywords contained in the question voice sent by the specific robot on the specific large-screen display device , To ensure the security of the content displayed on the screen.

The above description is only an overview of the technical solutions of this application. In order to understand the technical means of this application more clearly, it can be implemented in accordance with the content of the specification, and in order to make the above and other objectives, features and advantages of this application more obvious and understandable. In the following, specific examples of the application are cited.

BRIEF DESCRIPTION

By reading the detailed description of the preferred embodiments below, various other advantages and benefits will become clear to those of ordinary skill in the art. The drawings are only used for the purpose of illustrating preferred embodiments, and are not considered as a limitation to the application. Furthermore, throughout the drawings, the same reference symbols are used to denote the same components. In the drawings:

FIG. 1 is a flowchart of a human-computer interaction method according to an embodiment of the present application;

2 is a flowchart of a question answer matching question text in a human-computer interaction method according to an embodiment of the present application;

3 is a flowchart of sending a large-screen display instruction to a screen display device in a human-computer interaction method according to another embodiment of the present application;

4 is a functional block diagram of a human-computer interaction device according to an embodiment of the present application;

Fig. 5 is a schematic diagram of a computing device according to an embodiment of the present application.

detailed description

Hereinafter, exemplary embodiments of the present disclosure will be described in more detail with reference to the accompanying drawings. Although the exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure can be implemented in various forms and should not be limited by the embodiments set forth herein. On the contrary, these embodiments are provided to enable a more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art. Fig. 1 is a flowchart of an embodiment of a human-computer interaction method according to this application. As shown in Figure 1, the method includes the following steps:

Step S101: Obtain the user's question voice sent by the robot.

In this step, the robot user has a dialogue with the robot, and the dialogue mode can be a mobile terminal APP or a robot body. A robot control unit is arranged inside the robot body, and the robot control unit can receive question voices sent by a user. The robot described here includes a robot body and a robot control unit.

Step S102: Convert the question voice into question text.

In this step, when the robot control unit receives the question voice, it uses a preset voice transmission channel to transmit it to the robot management platform. The preset voice transmission channel is a preset channel dedicated to voice transmission, such as , Zeroc ice channel. After receiving the question voice, the robot management platform uses a preset voice conversion algorithm to convert the question voice into question text. The preset voice conversion algorithm is an existing technology, such as the voice developed by iFlytek Conversion algorithm.

Step S103: Match the question answer of the question text in the preset question answering library.

In this step, the question text is matched with the preset question and answer library to obtain the answer to the question. As shown in Figure 2, the question answer matching the question text in the preset question answering library includes the following steps:

Step S1031: Use a preset word segmentation algorithm to split the question text into multiple words.

In this step, the preset word segmentation algorithm is the prior art. When the question sentence is split, the sentence is split into a combination of several words according to the composition of the word in the sentence, such as: The sentence is "what to eat today?". When splitting, the question sentence will be split into a combination of "today", "eat" and "what".

Step S1032: Search for sentences containing at least one of the words in the preset question and answer library.

In this step, the format of the sentences stored in the preset question and answer library is a question-and-answer form, that is, there is an answer to the corresponding question after each question. The preset question and answer library is searched, and the words and The question in the preset question and answer library is searched for matching, for example, "What’s eating today?" The split words are a combination of the three words "today", "eat" and "what". The three words are used as the search content to be matched with the questions in the preset question and answer library respectively.

Step S1033: Calculate the similarity between the sentence and the question text to obtain a similarity value.

In this step, after the word is matched in the preset question and answer library, the sentence in which the word is located is calculated for the similarity with the question text. The calculation algorithm is the prior art and is not limited here. For example, if the question text is "What do you eat today?", after matching with the question in the preset question and answer library, one of the matched sentences is "What do you eat for lunch", then the word matching degree is used as the similarity calculation In the algorithm, there are two overlapping words between the question text and the matched sentence, namely "eat" and "what". The total word segmentation in the sentence is three words, so the similarity value is 67%.

Step S1034: Use the answer corresponding to the sentence with the highest similarity value as the answer to the question text.

In this step, calculate the similarity between each sentence retrieved in the preset question answering database and the question text, and consider that the sentence with the highest similarity is the closest to the question text, which is understandable After the user sends a question sentence to the robot, he hopes to get the answer corresponding to the input question sentence. Therefore, the answer corresponding to the sentence with the highest similarity in the preset question and answer library is sent as the question text Answer.

Step S104: Determine whether the answer to the question contains a keyword that is preset to trigger a large-screen display, if it contains, execute step S105, if not, execute step S106.

In this step, the preset question and answer library is preset with keywords that trigger large-screen display, and the keywords displayed on the large-screen are associated with image names in the image library displayed on the large-screen.

Step S105: Send the answer to the question to the robot, and send a large-screen display instruction to the screen display device, so that the screen display device displays according to the large-screen display instruction, wherein the large-screen display instruction carries The keywords.

In this step, when the answer to the question contains a keyword displayed on a preset large screen, while sending the answer to the question to the robot, a large screen display instruction is sent to the screen display device, and the large screen The display instruction carries the keywords displayed on the large screen, and the keywords displayed on the large screen are keywords preset in the preset question and answer database, for example, when the answer to the question is retrieved containing "iphone8" When one word is used, a large-screen display instruction is generated, and the large-screen display instruction is sent to the screen display device.

It should be noted that, after the screen display device receives the large-screen display instruction, it searches for the image file containing the keyword in a preset display library according to the large-screen display instruction, and displays the searched Image file. Specifically, according to the keyword, the screen display device is called to process an image file in a preset large-screen display image library, the image file is named after the keyword, or is related to the keyword rule When the image file is retrieved, the image file is played on the screen display device.

It should be understood that when using the keyword to retrieve the image file, there may be more than one image file related to the keyword. In this case, calculate the similarity between the keyword and the file name of the image file, and The image file corresponding to the image file name with the highest similarity is played, and the specific method for calculating the similarity can refer to the operation described when matching the question answer of the question text in the preset question and answer library, which will not be repeated here.

In one embodiment, the screen display device and the robot management platform are bridged through a back-end server. When the answer to the question contains the preset keyword that triggers large-screen display, the robot management platform will The keyword is sent to the back-end server. When the back-end server receives the keyword, it generates a large-screen display instruction, and sends the large-screen display instruction to the screen display device through a preset protocol. The preset The protocol corresponds to the background server, for example, the background server is a message queue telemetry transmission (MQTT) server, and the preset protocol is a message queue telemetry transmission protocol.

Step S106: Send the answer to the question to the robot.

In this step, when the answer to the question does not include the preset keyword that triggers the large-screen display, it means that the answer to the question does not need to be displayed on the screen display device. At this time, the answer to the question is sent to robot.

It should be understood that when the robot receives the answer to the question, it will broadcast the answer to the user in voice. In this process, the process of converting text into speech is involved. The conversion algorithm is the existing technology. No longer. In this process, the conversion algorithm can be preset in the robot control unit, and the converted voice can be directly sent to the robot body, or when the robot body receives the answer to the question, the robot control unit controls the robot body according to The preset conversion algorithm is converted into speech, and the specific conversion method is not limited here.

In the embodiment of the present application, by matching the answer of the question text in the preset question and answer library, and preset the keywords that trigger the large-screen display in the preset question and answer library, the voice communication between the robot and the user is realized. The video is interactive at the same time, so that the robot can show the dialogue content to the user in a more vivid and concrete way.

In some embodiments, multiple robots and screen display devices can be configured, and a corresponding screen display device is assigned to each robot in advance. When a robot needs a screen display device to assist in the display, the corresponding screen display device is controlled to display As shown in Figure 3, the step S105 sending a large-screen display instruction to the screen display device includes the following steps:

Step S301: Obtain the identification number of the robot sending the question voice.

In this step, the robot is preset with an identity label, and when the robot receives the question voice sent by the user, the question voice is sent to the robot management platform by carrying the robot identity label.

Step S302: Send a large-screen display instruction to the screen display device, where the screen display device is associated with the robot identity label.

In this step, the screen display device is pre-associated with the robot identity label, and when the answer to the question contains the keyword that triggers the large-screen display, the robot management platform will mark the identity label with the robot. The associated screen display device sends a large-screen display instruction, so that the screen display device associated with the robot identity performs display according to the large-screen display instruction. For example, there are two robots and three screen display devices, the identification numbers of the two robots are 1 and 2, respectively, the three screen display devices are marked as A, B, C, and the large-screen display devices A and B Subscribing to the question voice sent by robot 1, and large-screen display device C subscribes to the question voice sent by robot 2, and when the question voice received by robot 1 contains keywords that trigger the large-screen display, the robot management platform will carry The large-screen display instruction of the keyword is sent to the screen display devices A and B, but not to C.

It should be understood that: the large-screen display device can set a display library according to the use of its associated robot. For example, in a hospital, the robot 1 is used for diagnosis, and the preset display library can be set to be related to the location of each department of the hospital. Video files.

In this embodiment, by associating the screen display device with the robot identity label, it is realized that the video and audio corresponding to the large-screen display keywords contained in the question voice sent by the specific robot are played on the specific large-screen display device, ensuring that The security of the content displayed on the screen and the efficiency of the screen display are improved.

Fig. 4 is a functional block diagram of a human-computer interaction device of the present application. As shown in Fig. 4, the device includes: an acquisition module 401, a conversion module 402, a matching module 403, a judgment module 404, and a sending module 405, wherein the The obtaining module 401 is used to obtain the user's question voice sent by the robot; the conversion module 402 is used to convert the question voice into question text; the matching module 403 is used to match the question text of the question text in a preset question and answer library Answer; judging module 404, for judging whether the answer to the question contains a keyword preset to trigger large-screen display; sending module 405, for when the answer to the question contains a keyword preset to trigger large-screen display, Send the answer to the question to the robot, and send a large-screen display instruction to the screen display device, so that the screen display device displays according to the large-screen display instruction, wherein the large-screen display instruction carries the key word.

Wherein, the matching module 403 includes: a splitting unit 4031, a searching unit 4032, a calculation unit 4033, and a determining unit 4034. The splitting unit 4031 is used to split the question text into multiple words using a preset word segmentation algorithm The searching unit 4032 is used to search for sentences containing at least one of the words in the preset question and answer library; the calculating unit 4033 is used to calculate the similarity between the sentences and the question text to obtain the similarity value; the determining unit 4034 , Used to use the answer corresponding to the sentence with the highest similarity value as the answer to the question text.

Wherein, the sending module 405 includes a search unit 4051 and a display unit 4052. The search unit 4051 is configured to search for an image file containing the keyword in a preset display library according to the large-screen display instruction; The display unit 4052 is configured to display the searched image file.

Wherein, when the number of the robot and the screen display device are both multiple, the sending module further includes: an acquiring unit 4053 and a sending unit 4054, wherein the acquiring unit 4053 is configured to acquire and send the question The voice robot identity label; the sending unit 4054 is configured to send a large-screen display instruction to the screen display device, where the screen display device is associated with the robot identity label.

In the embodiment of the present application, the judgment module is used to judge whether the preset Q&A library contains a preset keyword that triggers large-screen display, and when the keyword is contained, the large-screen display instruction is sent to the screen display device through the sending module, so that The screen display device displays the content related to the keyword, so that the robot can display the video image vividly through the screen display device during the dialogue with the human; in addition, when there are multiple robots and large-screen display devices At the time, the robot’s identity label is obtained through the acquisition unit, and the large-screen display instruction is sent to the large-screen display device associated with the robot’s identity label through the sending module, thereby realizing the screen display contained in the question voice sent by the specific robot The video and audio files corresponding to the keywords are displayed on a specific large-screen display device to ensure the security of the content displayed on the screen.

The embodiments of the present application provide a non-volatile computer-readable storage medium, the computer-readable storage medium stores at least one executable instruction, and the computer-executable instruction can execute one of the above-mentioned method embodiments. Machine interaction method.

FIG. 5 is a schematic structural diagram of an embodiment of a computing device of this application, and the specific embodiment of this application does not limit the specific implementation of the computing device.

As shown in FIG. 5, the computing device may include: a processor (processor) 502, a communication interface (Communications Interface) 504, a memory (memory) 506, and a communication bus 508.

among them:

The processor 502, the communication interface 504, and the memory 506 communicate with each other through the communication bus 508.

The communication interface 504 is used to communicate with other devices.

The processor 502 is configured to execute the program 510, and specifically can execute the relevant steps in the foregoing embodiment of the human-computer interaction method.

Specifically, the program 510 may include program code, and the program code includes computer operation instructions.

The processor 502 may be a central processing unit CPU, or an ASIC (Application Specific Integrated Circuit), or one or more integrated circuits configured to implement the embodiments of the present application. The one or more processors included in the computing device may be the same type of processor, such as one or more CPUs, or different types of processors, such as one or more CPUs and one or more ASICs.

The memory 506 is used to store the program 510. The memory 506 may include a high-speed RAM memory, and may also include a non-volatile memory (non-volatile memory), for example, at least one disk memory.

The program 510 may be specifically used to cause the processor 502 to perform the following operations:

Obtain the user’s question voice sent by the robot; convert the question voice into question text; match the question answer of the question text in the preset question and answer library; determine whether the question answer contains the key to trigger the large-screen display by default If it contains, send the answer to the question to the robot, and send a large-screen display instruction to the screen display device, so that the screen display device displays according to the large-screen display instruction, wherein the large-screen display The instruction carries the keyword.

In an optional manner, the program 510 may be further specifically used to cause the processor 502 to perform the following operations:

Use a preset word segmentation algorithm to split the question text into multiple words; search a preset question and answer library for sentences containing at least one of the words; calculate the similarity between the sentence and the question text to obtain a similarity value ; Use the answer corresponding to the sentence with the highest similarity value as the answer to the question text.

In an optional manner, the program 510 may be further configured to cause the processor 502 to perform the following operations: according to the large-screen display instruction, search for an image file containing the keyword in a preset display library; The image file.

In an optional manner, when the numbers of the robot and the screen display device are both multiple, the program 510 may be further specifically configured to cause the processor 502 to perform the following operations: Obtain the robot that sent the question voice Identity label; sending a large-screen display instruction to the screen display device, wherein the screen display device is associated with the robot identity label.

The algorithms and displays provided here are not inherently related to any particular computer, virtual system or other equipment. Various general-purpose systems can also be used with the teaching based on this. From the above description, the structure required to construct this type of system is obvious. In addition, this application is not aimed at any specific programming language. It should be understood that various programming languages can be used to implement the content of the application described herein, and the above description of a specific language is for disclosing the best embodiment of the application.

The specification provided here explains a lot of specific details. However, it can be understood that the embodiments of the present application can be practiced without these specific details. In some instances, well-known methods, structures, and techniques have not been shown in detail so as not to obscure the understanding of this description.

Similarly, it should be understood that in order to simplify the present disclosure and help understand one or more of the various inventive aspects, in the above description of the exemplary embodiments of the present application, the various features of the present application are sometimes grouped together into a single embodiment, Figure, or its description. However, the disclosed method should not be construed to reflect the intention that the claimed application requires more features than the features explicitly recorded in each claim. More precisely, as reflected in the claims, the inventive aspect lies in less than all the features of a single embodiment disclosed previously. Therefore, the claims following the specific embodiment are thus explicitly incorporated into the specific embodiment, wherein each claim itself serves as a separate embodiment of the application.

Those skilled in the art can understand that it is possible to adaptively change the modules in the device in the embodiment and set them in one or more devices different from the embodiment. The modules or units or components in the embodiments may be combined into one module or unit or component, and in addition, they may be divided into a plurality of submodules or subunits or subcomponents. Except that at least some of such features and/or processes or units are mutually exclusive, all features disclosed in this specification (including the accompanying claims, abstract and drawings) and any method so disclosed may be adopted in any combination All processes or units of equipment are combined. Unless expressly stated otherwise, each feature disclosed in this specification (including the accompanying claims, abstract and drawings) may be replaced by alternative features serving the same, equivalent or similar purpose.

In addition, those skilled in the art can understand that although some embodiments described herein include certain features included in other embodiments but not other features, the combination of features of different embodiments means that they are within the scope of the present application. Within and form different embodiments. For example, in the following claims, any one of the claimed embodiments can be used in any combination.

Each component embodiment of the present application may be implemented by hardware, or by software modules running on one or more processors, or by a combination of them. Those skilled in the art should understand that a microprocessor or a digital signal processor (DSP) may be used in practice to implement some or all of the functions of some or all of the components in the human-computer interaction device according to the embodiments of the present application. The application can also be implemented as a device or device program (for example, a computer program and a computer program product) for executing part or all of the methods described herein. Such a program for implementing the present application may be stored on a computer-readable medium, or may have the form of one or more signals. Such a signal can be downloaded from an Internet website, or provided on a carrier signal, or provided in any other form.

It should be noted that the above-mentioned embodiments illustrate rather than limit the application, and those skilled in the art can design alternative embodiments without departing from the scope of the appended claims. In the claims, any reference signs between parentheses should not be constructed as limitations on the claims. The word "comprising" does not exclude the presence of elements or steps not listed in the claims. The word "a" or "an" preceding an element does not exclude the presence of multiple such elements. The application can be implemented by means of hardware including several different elements and by means of a suitably programmed computer. In the unit claims enumerating several devices, several of these devices may be embodied by the same hardware item. The use of the words first, second, and third does not indicate any order. These words can be interpreted as names.

Claims

A human-computer interaction method, characterized in that it comprises:

Get the user's question voice sent by the robot;

Convert the question voice into question text;

The answer to the question matching the question text in the preset question and answer library;

Determine whether the answer to the question contains keywords that preset to trigger large-screen display;

If it does, send the answer to the question to the robot, and send a large-screen display instruction to the screen display device, so that the screen display device displays according to the large-screen display instruction, wherein the large-screen display instruction carries The keywords.
The method according to claim 1, wherein the question answer matching the question text in the preset question answering library comprises:

Use a preset word segmentation algorithm to split the question text into multiple words;

Searching for sentences containing at least one of the words in the preset question and answer library;

Calculate the similarity between the sentence and the question text to obtain the similarity value;

The answer corresponding to the sentence with the highest similarity value is used as the answer to the question text.
The method according to claim 1, wherein the screen display device displays according to the large-screen display instruction, wherein the large-screen display instruction carrying the keyword includes:

Searching for an image file containing the keyword in a preset display library according to the large-screen display instruction;

Display the searched image file.
The method according to claim 1, wherein when the number of the robot and the screen display device are both multiple, the sending a large-screen display instruction to the screen display device further comprises:

Acquiring the identification number of the robot that sent the question voice;

Send a large-screen display instruction to the screen display device, where the screen display device is associated with the robot identity label.
A human-computer interaction device, characterized in that it comprises:

Acquisition module: used to acquire the user's question voice sent by the robot;

Conversion module: used to convert the question voice into question text;

Matching module: used to match the question answer of the question text in the preset question and answer library;

Judgment module: used to judge whether the answer to the question contains a keyword preset to trigger a large-screen display;

Sending module: when the answer to the question contains a keyword that is preset to trigger large-screen display, send the answer to the question to the robot, and send a large-screen display instruction to the screen display device to make the screen display The device displays according to the large-screen display instruction, where the large-screen display instruction carries the keyword.
The device according to claim 5, wherein the matching module comprises:

Splitting unit: used to split the question text into multiple words using a preset word segmentation algorithm;

Search unit: used to search for sentences containing at least one of the words in the preset question and answer library;

Calculation unit: used to calculate the similarity between the sentence and the question text to obtain the similarity value;

The determining unit: used to use the answer corresponding to the sentence with the highest similarity value as the answer to the question text.
The device according to claim 5, wherein the sending module comprises:

Searching unit: used to search for an image file containing the keyword in a preset display library according to the large-screen display instruction;

Display unit: used to display the searched image file.
The apparatus according to claim 5, wherein when the number of the robot and the screen display device are both multiple, the sending module further comprises:

Acquisition unit: used to acquire the identification number of the robot that sent the question voice;

Sending unit: used to send a large-screen display instruction to the screen display device, wherein the screen display device is associated with the robot identity label.
A computing device includes: a processor, a memory, a communication interface, and a communication bus. The processor, the memory, and the communication interface communicate with each other through the communication bus;

The memory is used to store at least one executable instruction, and the executable instruction causes the processor to perform an operation corresponding to a human-computer interaction method according to any one of claims 1-4.
A computer-readable storage medium, wherein at least one executable instruction is stored in the storage medium, and the executable instruction causes a processor to execute the corresponding human-computer interaction method according to any one of claims 1-4 Operation.