WO2021060391A1 - Information providing method, information providing system, information providing device, and computer program - Google Patents

Information providing method, information providing system, information providing device, and computer program Download PDF

Info

Publication number
WO2021060391A1
WO2021060391A1 PCT/JP2020/036093 JP2020036093W WO2021060391A1 WO 2021060391 A1 WO2021060391 A1 WO 2021060391A1 JP 2020036093 W JP2020036093 W JP 2020036093W WO 2021060391 A1 WO2021060391 A1 WO 2021060391A1
Authority
WO
WIPO (PCT)
Prior art keywords
information
response
request
user
voice
Prior art date
Application number
PCT/JP2020/036093
Other languages
French (fr)
Japanese (ja)
Inventor
戸田良樹
詩博 徐
Original Assignee
Tradfit株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tradfit株式会社 filed Critical Tradfit株式会社
Priority to JP2021507724A priority Critical patent/JP6920773B1/en
Publication of WO2021060391A1 publication Critical patent/WO2021060391A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/10Speech classification or search using distance or distortion measures between unknown speech and reference templates
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue

Definitions

  • This disclosure relates to an information providing method, an information providing system, an information providing device, and a computer program that provide various information to a user who uses an accommodation facility.
  • Patent Document 1 proposes a communication system that establishes a dialogue in natural language between a user and a robot.
  • the robot has a microphone and a speaker, transmits the user's utterance acquired by the microphone to the server, receives a response from the server, and outputs the response by voice from the speaker.
  • the server has a conversation database of questions and answers used for conversation, and refers to this conversation database to generate a response to a user's utterance and send it to a robot.
  • the present disclosure has been made in view of such circumstances, and the purpose of the present disclosure is to provide information by appropriately using input / output of voice and text, an information providing method, an information providing system, and information providing. To provide equipment and computer programs.
  • the information providing method acquires a request based on the voice input to the user interface installed in the accommodation facility, acquires the response information corresponding to the acquired request from the database, and obtains the acquired response information from the user.
  • the interface is output by voice, and when the response information cannot be obtained from the database, the response is switched to an automatic response by inputting / outputting text.
  • a request based on voice input from the user is acquired by the user interface installed in the accommodation facility, and the response information to the acquired request is output by voice in the user interface. If it is not possible to output the response information for the acquired request, switch to automatic response by inputting / outputting text. This allows the user to easily switch from voice input / output to text input / output.
  • information can be provided to the user by appropriately using the input / output of voice and text.
  • the information providing system 10 can provide information to a user by using a smart speaker 20 installed as a user interface in an accommodation facility such as a hotel guest room, a common area, or a private house used for a private lodging.
  • the smart speaker 20 shown in FIG. 1 has a configuration in which a touch panel 25, a microphone 26, a speaker 27, a camera 28, and the like are appropriately arranged in a substantially hemispherical housing.
  • the shape of the smart speaker 20 is an example and is not limited to this, and various shapes such as a columnar shape or a rectangular parallelepiped shape can be adopted.
  • the user interface installed in the guest room is not limited to the smart speaker 20, and may be any device as long as it is an information processing device capable of input / output by voice.
  • a user (guest) staying in a guest room can activate the smart speaker 20 by speaking a specific keyword (for example, "start service").
  • the user can give various requests to the smart speaker 20 by voice input by utterance, and various information such as information about this accommodation facility can be obtained from the voice response information output from the smart speaker 20. ..
  • the user gives an utterance request to the smart speaker 20 "Tell me the check-out time".
  • requests various requests or questions that the user inputs to the smart speaker 20 or the like are referred to as requests.
  • the voice request input by the user to the smart speaker 20 is input to the voice processing server 50 via a network such as the Internet.
  • the voice processing server 50 converts the input voice information into text (character string) information by performing voice recognition processing on the voice input.
  • the voice processing server 50 inputs the converted text to the management server 30.
  • the management server 30 performs a process of generating response information to a user's request based on the text information input from the voice processing server 50.
  • the management server 30 uses the intention identification model 81 and the response information DB (database) 82.
  • the intention identification model 81 is a trained model that has been trained in advance so as to accept text information related to a request as input and output an identification result that identifies the user's intention related to this request.
  • the response information DB 82 is a database in which the intention related to the request and the text of the response message to the intention are stored in association with each other.
  • the management server 30 includes the intention identification model 81 and the response information DB 82, but the present invention is not limited to this, and the intention identification model 81 or the response information DB 82 may be provided by another server.
  • the management server 30 inputs the text information obtained from the voice processing server 50 into the intention identification model 81, and acquires the information output by the intention identification model 81 to acquire the intention of the user's request.
  • the intention identification model 81 identifies the user's intention to inquire about the checkout time with respect to the text information of "tell me the checkout time”.
  • the management server 30 acquires the text information of the response message for the intention of the request by referring to the response information DB 82 based on the intention of the identified user.
  • the management server 30 outputs the text information acquired from the response information DB 82 to the voice processing server 50.
  • the text information "Please check out by 10 am" is stored in the response information DB 82 as the response information.
  • the voice processing server 50 to which the text output of the response information is given from the management server 30 converts the given text information into voice information.
  • the voice processing server 50 outputs the converted voice information to the smart speaker 20.
  • the smart speaker 20 acquires the voice information output from the voice processing server 50, and outputs the voice based on the voice information to the speaker 27. In the example shown in FIG. 1, the smart speaker 20 outputs the voice "Please check out by 10 am".
  • the management server 30 can also respond to a request from the user to the smart speaker 20 other than voice output.
  • the management server 30 can display an image on the touch panel 25 of the smart speaker 20.
  • the management server 30 outputs image information for display to the voice processing server 50, and the voice processing server 50 gives this image information to the smart speaker 20.
  • the above-mentioned information provision to the user by voice input / output is performed when, for example, the language spoken by the user is not included in the language corresponding to the voice processing of the voice processing server 50, or when the content of the user's speech is, for example, the voice processing server 50.
  • the information providing system when it is not possible to properly convert to text information, it may not be possible to provide appropriate information to the user.
  • the information providing system according to the present embodiment when it is not possible to appropriately provide information by voice input / output, it switches to information provision by text input / output.
  • a system that automatically responds by inputting / outputting text is a so-called chatbot, and the information providing system according to the present embodiment provides information using the smart speaker 20 and information providing using the chatbot. It is a system that switches between and appropriately.
  • the management server 30 is notified, for example, that the voice processing server 50 cannot convert the voice into text, or when the intention identification model 81 cannot identify the intention of the user's request.
  • a message prompting the user to switch to the chatbot is output to the smart speaker 20.
  • the message can be, for example, "The voice request could not be recognized. Do you want to use a chatbot?" This message may be output by the smart speaker 20 as voice, or may be displayed as text on the touch panel 25.
  • the management server 30 provides access information for accessing the chatbot using a terminal device such as a smartphone 40. Output to the smart speaker 20.
  • the user's will to decide whether or not to use the chatbot may be accepted, for example, as a voice input, or may be accepted, for example, by a touch operation on the touch panel 25 of the smart speaker 20.
  • FIG. 3 is a schematic diagram showing an example of access information displayed on the smart speaker 20.
  • the smart speaker 20 displays the access information 20a on the touch panel 25.
  • the illustrated access information 20a is a two-dimensional code (for example, a barcode or a QR code (registered trademark)), and is a code of information such as a URL (Uniform Resource Locator) for accessing a chatbot, for example.
  • the access information 20a includes identification information for identifying the accommodation facility, the guest room, and the like.
  • the chatbot can acquire the identification information included in the access information 20a and perform an automatic response suitable for the accommodation facility, the guest room, and the like.
  • the access information 20a may have a configuration that includes only information such as a URL and does not include identification information such as accommodation facilities and guest rooms. In this case, for example, the user may input information about accommodation facilities, guest rooms, and the like.
  • the user can acquire information such as a URL for accessing the chatbot by photographing the access information 20a displayed on the smart speaker 20 with the camera of his / her smartphone 40.
  • the user who has acquired the access information 20a can start using the chatbot by accessing the server that provides the chatbot service using an application such as a browser installed on the smartphone 40.
  • the management server 30 provides the chatbot service.
  • a chat screen is displayed on the display unit of the smartphone 40, and the user can input the request as text on the chat screen by using the text input function of the smartphone 40.
  • FIG. 4 and 5 are schematic views showing an example of a chat screen displayed on the smartphone 40.
  • FIG. 4 shows a display example of a selection box that prompts the user to select a language prior to using the chatbot.
  • English, Japanese, Chinese, Korean, etc. can be used as the chat language, and the selectable languages are listed in the notation of the language in the selection box.
  • the user can select any one language by performing a touch (tap) operation on the language displayed in the selection box.
  • the language selection box shown in FIG. 4 is not only displayed prior to the start of chat, but also when the user touches the language selection icon provided on the lower left side of the chat screen shown in FIG. May also be displayed.
  • the language selection icon is labeled to indicate which language is currently selected. Further, on the chat screen of FIG. 5, a switching icon for switching to the manned response is displayed next to the language selection icon.
  • the user selects the language from the selection box shown in FIG. 4, but the present invention is not limited to this.
  • information such as the nationality of the user may be acquired from a server device that stores information about the guests of the accommodation facility, and a language suitable for the user may be automatically selected.
  • FIG. 5 shows an example of a request and response by chat.
  • English is selected as the chat language.
  • chat screen for example, title character strings such as "hotel information service”, “Example hotel", and “inquiry” are displayed at the top, and this chat screen is a chat screen for making inquiries to the hotel. It is shown.
  • the text of the request entered by the user and the text of the response to it are displayed in chronological order from top to bottom.
  • the text entered by the user is displayed on the right side of the chat screen, and the text of the response by the chatbot is displayed on the left side.
  • the text "How many waters would you like to order?" Is output as the response information to the text of the request "I want a water.”
  • the text information input on the smartphone 40 is input to the translation server 60 via a network such as the Internet.
  • the translation server 60 translates the input text information as needed.
  • the translation server 60 translates the text into a language supported by the management server 30.
  • the management server 30 can handle Japanese and English and a Chinese request is input
  • the translation server 60 translates the request from Chinese to Japanese or English and converts the translated text information.
  • the management server 30 may input the text information to the management server 30 without translating.
  • the management server 30 performs a process of generating response information to a user's request based on the text information input from the translation server 60.
  • the processing of the management server 30 performed at this time is the same as the processing of generating response information for the user's request by voice input, and is performed using the above-mentioned intention identification model 81 and response information DB 82.
  • the management server 30 outputs the response information of the generated text to the translation server 60.
  • the translation server 60 appropriately translates the response information given by the management server 30 as necessary, and outputs the response information to the smartphone 40.
  • the smartphone 40 automatically responds to the request input by the user by displaying the response information given as text from the translation server 60 as text on the chat screen.
  • the management server 30 includes an intention identification model 81 and a response information DB 82 for acquiring response information for a voice input request, and an intention identification model 81 and a response information DB 82 for acquiring response information for a text input request. It may be prepared separately.
  • Information is provided to the user by the chatbot, for example, when the request input by the user in text cannot identify the intention by the intention identification model 81, or when the response information corresponding to the response information DB 82 is not stored, for example. It may not be possible to provide appropriate information to the user.
  • the operator switches to the manned response.
  • there is an operator who responds to the request from the user instead of the management server 30 when the request from the user cannot be responded to by the automatic response.
  • the operator can use the operator terminal 70 to exchange textual information with the user's smartphone 40, that is, to chat.
  • the user can chat with the operator using the same chat screen as the chatbot's automatic response.
  • the management server 30 determines that the automatic response by text input / output cannot be performed, the management server 30 causes the smartphone 40 to output a message prompting the user to switch to the manned response.
  • the message can be, for example, "We cannot answer your request. If you want to switch to a manned response, please touch the switch icon below.”
  • the management server 30 notifies the operator terminal 70 to request the manned response. To switch from the chatbot's automatic response to the operator's manned response.
  • switching to a manned response by touching the switching icon provided on the chat screen may be accepted only when the management server 30 outputs a message prompting the switching to the smartphone 40, and whether or not this message is output may be accepted. Regardless, it may be accepted at all times.
  • the management server 30 accepts the switch only when the message prompting the switch is output to the smartphone 40, the switch icon does not have to be displayed on the chat screen when it is not needed.
  • the management server 30 transmits the text information input by the user on the smartphone 40 and translated by the translation server 60 to the operator terminal 70 as needed.
  • the operator terminal 70 displays a chat screen similar to that in FIG. 5 on the display unit based on the text information from the management server 30.
  • the operator inputs the information of the response to the user's request displayed on the operator terminal 70 to the operator terminal 70 by using an input device such as a keyboard.
  • the operator inputs text to the operator terminal 70, but the present invention is not limited to this, and a configuration such as voice input may be performed.
  • the operator terminal 70 accepts the text input of the response by the operator and transmits the received text information to the management server 30.
  • the management server 30 gives the text information from the operator terminal 70 to the translation server 60, and the text information translated by the translation server 60 is given to the user's smartphone 40 as needed.
  • the smartphone 40 displays the given text information on the chat screen. As a result, communication by the user and the operator via text is established, and the operator can respond to the user's request.
  • switching from information provision by voice input / output using the smart speaker 20 to information provision by text input / output using the smartphone 40 is performed via the access information displayed on the smart speaker 20.
  • the management server 30 outputs a switching instruction from voice input / output to text input / output to the device. Based on the output, the device may switch from voice input / output to text input / output.
  • FIG. 6 is a block diagram showing the configuration of the smart speaker 20 according to the present embodiment.
  • the smart speaker 20 includes a processing unit (processor) 21, a storage unit (storage) 22, a communication unit (transceiver) 23, a touch panel 25, a microphone 26, a speaker 27, a camera 28, and the like.
  • the processing unit 21 is configured by using an arithmetic processing unit such as a CPU (Central Processing Unit) or an MPU (Micro-Processing Unit). Further, the processing unit 21 may employ a plurality of CPUs, a multi-core CPU, or the like.
  • the processing unit 21 reads and executes the program 22a stored in the storage unit 22 to perform various processes such as reception of voice input, voice output, and image display.
  • the storage unit 22 is configured by using, for example, a non-volatile memory element such as a flash memory, a magnetic storage device such as a hard disk, or the like.
  • the storage unit 22 stores various programs executed by the processing unit 21 and various data required for processing by the processing unit 21.
  • the storage unit 22 stores the program 22a executed by the processing unit 21 and the identification information 22b for identifying the smart speaker 20.
  • the program 22a may be written in the storage unit 22 at the manufacturing stage of the smart speaker 20, for example.
  • the smart speaker 20 may acquire what is distributed by a remote server device or the like by communication.
  • the smart speaker 20 may read the program 22a recorded on the recording medium 99 such as a memory card or an optical disk and store it in the storage unit 22.
  • the writing device may read what has been recorded on the recording medium 99 and write it in the storage unit 22 of the smart speaker 20.
  • the program 22a may be provided in the form of distribution via the network, or may be provided in the form recorded on the recording medium 99.
  • the identification information 22b is information in which characters, numerical values, and the like are appropriately combined, and may be any information as long as it can identify the smart speaker 20.
  • the identification information 22b is information in which information for identifying the accommodation facility in which the smart speaker 20 is installed and information for identifying the guest room are combined.
  • the identification information 22b can be information indicating room 301 of the Example hotel.
  • the identification information 22b is set by the system administrator, the accommodation facility administrator, or the like when the smart speaker 20 is installed in the guest room of the accommodation facility.
  • the communication unit 23 can communicate with various devices via a network N including the Internet, a wireless LAN (Local Area Network), a mobile phone communication network, and the like.
  • the communication unit 23 communicates with the voice processing server 50 via the network N.
  • the communication unit 23 transmits the data given by the processing unit 21 to the voice processing server 50, and gives the data received from the voice processing server 50 to the processing unit 21.
  • the touch panel 25 is one of the user interfaces included in the smart speaker 20, and has a display unit 25a and an input unit 25b.
  • the display unit 25a is configured by using a liquid crystal display or the like, and displays various images, characters, and the like.
  • the display unit 25a displays various images based on the processing of the processing unit 21.
  • the input unit 25b is provided with a sensor for detecting contact by the user on the surface of the display unit 25a, and receives and accepts a touch (tap) operation or the like for an image or the like displayed on the display unit 25a as an input. Is notified to the processing unit 21.
  • the microphone 26 and the speaker 27 are one of the user interfaces included in the smart speaker 20, and realize a user interface by voice input / output.
  • the microphone 26 acquires peripheral voice, converts it into digital voice information, and gives the voice information to the processing unit 21.
  • the speaker 27 outputs audio based on the audio information given by the processing unit 21.
  • the camera 28 takes an image of, for example, a user or a guest room, and gives the captured image information to the processing unit 21.
  • the smart speaker 20 acquires a request uttered by the user at the speaker 27, and transmits the voice information related to the acquired request to the voice processing server 50 at the communication unit 23. At this time, the smart speaker 20 attaches the identification information 22b stored in the storage unit 22 and transmits the voice information. This voice information is converted into text information by the voice processing server 50 and given to the management server 30, and the management server 30 transmits a response to the request as text information to the voice processing server 50.
  • the voice processing server 50 converts the text information from the management server 30 into voice information and transmits it to the smart speaker 20.
  • the smart speaker 20 outputs the voice information given by the voice processing server 50 from the speaker 27.
  • the management server 30 If the management server 30 cannot respond to the user's request based on the text information given by the voice processing server 50, the management server 30 notifies that fact and transmits access information for accessing the chatbot. This notification and access information is given to the smart speaker 20 via the voice processing server 50. In response to this notification, the smart speaker 20 asks the user whether or not to switch to the chatbot, and when the switch to the chatbot is affirmed, as shown in FIG. 3, for accessing the two-dimensional code. The information is displayed on the display unit 25a of the touch panel 25.
  • FIG. 7 is a block diagram showing the configuration of the management server 30 according to the present embodiment.
  • the management server 30 includes a processing unit (processor) 31, a storage unit (storage) 32, a communication unit (transceiver) 33, and the like.
  • the processing unit 31 is configured by using an arithmetic processing unit such as a CPU, MPU, or GPU (Graphics Processing Unit).
  • the processing unit 31 reads and executes the server program 32a stored in the storage unit 32 to respond to a user's request, switch from voice input / output to a chatbot, and change from a chatbot to a manned response. Performs various processes such as switching process.
  • the storage unit 32 is configured by using a large-capacity storage device such as a hard disk.
  • the storage unit 32 stores various programs executed by the processing unit 31 and various data required for processing by the processing unit 31.
  • the storage unit 32 stores the server program 32a executed by the processing unit 31 and the intention identification model 81 as a learned model (discriminative device).
  • the storage unit 32 is provided with two databases, a response information DB 82 in which a response to a user's request is stored, and an operator DB 83 in which information about an operator who performs a manned response is stored.
  • the communication unit 33 can communicate with various devices via the network N including the Internet, wireless LAN, mobile phone communication network, and the like.
  • the communication unit 33 communicates with the voice processing server 50, the translation server 60, the operator terminal 70, and the like via the network N.
  • the communication unit 33 transmits the data given by the processing unit 31 to another device, and gives the data received from the other device to the processing unit 31.
  • the storage unit 32 may be an external storage device connected to the management server 30.
  • the management server 30 may be a multi-computer including a plurality of computers, or may be a virtual machine virtually constructed by software. Further, the management server 30 is not limited to the above configuration, and may include, for example, a reading unit that reads information stored in a portable storage medium, an input unit that accepts operation input, a display unit that displays an image, and the like. ..
  • the server program 32a may be written to the storage unit 32, for example, at the manufacturing stage of the management server 30.
  • the management server 30 may acquire what is distributed by another remote server device or the like by communication.
  • the management server 30 may read the server program 32a recorded on the recording medium 98 such as a memory card or an optical disk and store it in the storage unit 32.
  • the writing device may read what was recorded on the recording medium 98 and write it in the storage unit 32 of the management server 30.
  • the server program 32a may be provided in a mode of distribution via a network, or may be provided in a mode recorded on a recording medium 98.
  • the intention identification model 81 is a trained model in which machine learning or deep learning using teacher data has been performed in advance.
  • the trained model performs a predetermined operation on the input value and outputs the operation result, and the storage unit 32 stores data such as the coefficient and the threshold of the function that defines this operation as the intention identification model 81. Will be done.
  • the intention identification model 81 is a learned model learned to identify the user's intention with respect to the text information input by the user as voice or text.
  • the processing unit 31 that executes the server program 32a reads the data stored as the intention identification model 81, so that the processing unit 31 can execute an operation for identifying the intention of the user's request.
  • FIG. 8 is a schematic diagram showing a configuration example of the intention identification model 81.
  • the intention identification model 81 includes an input layer that accepts input of text information related to a user's request, an intermediate layer that performs a predetermined operation on the input information, and an identification result of the intention of the user's request. It is configured as a neural network having an output layer that outputs information indicating the above.
  • the intention identification model 81 may adopt, for example, a configuration of RNN (Recurrent Neural Network) as the neural network, but the present invention is not limited to this, and a model other than RNN may be adopted.
  • the management server 30 generates the intention identification model 81 by learning the RNN model using the text information related to the request and the teacher data associated with the intention.
  • the management server 30 is provided with the intention identification model 81, but the present invention is not limited to this.
  • the intention identification model 81 is provided in another server device, the text information related to the user's request acquired by the management server 30 is transmitted to the other server device, and the identification result by the intention identification model 81 possessed by the other server device is managed.
  • the server 30 may acquire it.
  • the management server 30 performs the learning process of the intention identification model 81.
  • the learning process may be performed by another server device.
  • the intention identification model 81 learned in this case may be transmitted from another server device to the management server 30 and stored in the storage unit 32, or may be held by another server device.
  • the trained intention identification model 81 may be provided in the form of distribution via the network, or may be provided in the form recorded on the recording medium 98, similarly to the server program 32a.
  • FIG. 9 is a schematic diagram showing a configuration example of the response information DB 82.
  • the response information DB 82 is a database in which the intention of the user's request and the response information such as a response message for this request are stored in association with each other. Each item of the request intention of the response information DB 82 corresponds to the identification result of the intention identification model 81.
  • the response information of the response information DB 82 is text information of a message that responds to the intention of the request as a voice output of the smart speaker 20 or a text output by a chatbot. Since the response to the request differs depending on the accommodation facility, the response information DB 82 is provided for each accommodation facility.
  • the response information DB 82 shown in FIG. 9 is in Japanese, but a response information DB 82 corresponding to another language such as English or Chinese may be provided as needed.
  • the example shown in FIG. 9 is an example of the request intention and response information of the response information DB 82 of the “Example hotel” as an accommodation facility.
  • the request given from the smart speaker 20 to the management server 30 via the voice processing server 50 and the request given from the smartphone 40 to the management server 30 via the translation server 60 are used to identify accommodation facilities, guest rooms, and the like. Identification information is attached as, for example, header information.
  • the management server 30 has response information DB 82 for a plurality of accommodation facilities, and can use response information DB 82 suitable for the accommodation facility according to the identification information attached to the request.
  • the intention and response information of the requested request shown in the figure is an example, and is not limited to this.
  • FIG. 10 is a schematic diagram showing a configuration example of the operator DB 83.
  • the operator DB 83 is a database in which conditions relating to the date and time and information relating to the operator to be switched to the manned response are stored in association with each other.
  • the operator DB 83 is provided for each accommodation facility. In the illustrated example, it is set that the operator of the operator center makes a manned response to the date and time from 8:00 am to 8:00 pm on weekdays and from 6:00 am to 10:00 pm on holidays. Also, for dates and times on weekdays from midnight to 8 am and 8 pm to 12 pm and on holidays from midnight to 6 am and from 10 pm to 12 pm The front operator is set to provide a manned response.
  • the management server 30 When switching to the manned response, the management server 30 refers to the operator DB 83 based on the date and time information and determines the operator to perform the manned response. The management server 30 transmits a request for a manned response to the operator terminal 70 used by the determined operator.
  • the operator center, the hotel front, and the like are set as operators, but in reality, the identification information and the like of the operator terminal 70 provided in these may be set.
  • the operator DB 83 may store information about the language that the operator can handle.
  • the management server 30 can make the operator who can handle the language used by the user perform a manned response.
  • the management server 30 may allow the user to select the language used by the user, for example, when switching to a manned response, or determines the user's language from information such as a guest list of accommodation facilities. May be good.
  • the processing unit 31 reads and executes the server program 32a stored in the storage unit 32, so that the request acquisition unit 31a, the response processing unit 31b, the switching processing unit 31c, and the learning unit 31c are executed.
  • the processing unit 31d and the like are realized as software-like functional blocks.
  • the request acquisition unit 31a is a process in which the communication unit 33 communicates with the voice processing server 50 so that the user inputs voice to the smart speaker 20 and the voice processing server 50 acquires a request converted into text information. I do.
  • the request acquisition unit 31a is a process in which the communication unit 33 communicates with the translation server 60 so that the user inputs to the smartphone 40 and the translation server 60 acquires a request translated into an appropriate language. I do.
  • the request information acquired by the request acquisition unit 31a is all text information.
  • the response processing unit 31b responds to the request acquired by the request acquisition unit 31a by using the intention identification model 81 and the response information DB 82 stored in the storage unit 32.
  • the response processing unit 31b inputs the text information of the request acquired by the request acquisition unit 31a into the intention identification model 81, and acquires the intention identification result output by the intention identification model 81.
  • the response processing unit 31b refers to the response information DB 82 based on the intention of the acquired request, and acquires the response information corresponding to the intention.
  • the response processing unit 31b transmits the acquired response information to the voice processing server 50 or the translation server 60, and makes a response by the voice output of the smart speaker 20 or a response by the text output of the smartphone 40.
  • the switching processing unit 31c switches from a voice response using the smart speaker 20 to a chatbot using the smartphone 40 when an appropriate response cannot be made to the user's request acquired by the request acquisition unit 31a. Or, switch from a chatbot to a manned response.
  • the switching processing unit 31c for example, when the intention identification model 81 cannot identify the intention with respect to the text information of the request acquired by the request acquisition unit 31a, or the response information corresponding to the identified intention is transmitted from the response information DB 82. If it cannot be obtained, switch it.
  • the switching processing unit 31c When the switching processing unit 31c cannot respond to the request given by the voice processing server 50, the switching processing unit 31c transmits an instruction prompting the smart speaker 20 that is the source of this request to switch to the chatbot. To do.
  • the instruction given from the management server 30 to the smart speaker 20 via the voice processing server 50 includes an image of the access information 20a necessary for accessing the chatbot.
  • the access information 20a is, for example, two-dimensionally coded information of a URL for accessing a chatbot and identification information of an accommodation facility and a guest room in which the smart speaker 20 is installed.
  • the smart speaker 20 Upon receiving this instruction, the smart speaker 20 asks the user whether or not to switch to the chatbot, and displays the access information 20a when a reply to the effect of switching is obtained.
  • the switching processing unit 31c When the switching processing unit 31c cannot respond to the request given by the translation server 60, the switching processing unit 31c transmits an instruction prompting the switching to the manned response to the smartphone 40 that is the transmission source of this request.
  • the smartphone 40 inquires the user whether or not to switch from the chatbot to the manned response, and when a reply to the effect of switching is obtained, sends a switching request to the management server 30.
  • the switching processing unit 31c of the management server 30 acquires information on the date and time at that time, refers to the operator DB 83 stored in the storage unit 32, and determines the operator to be switched to.
  • the management server 30 may determine the operator to switch to in consideration of the languages that the operator can handle.
  • the switching processing unit 31c transmits an instruction to perform a manned response to the operator terminal 70 used by the determined operator. After that, the management server 30 relays text information related to the user's request and the operator's response between the smartphone 40 and the operator terminal 70.
  • the learning processing unit 31d performs a process of learning (re-learning) the intention identification model 81.
  • the learning processing unit 31d requests the operator to input teacher data after, for example, the switching processing unit 31c performs the above-mentioned switching and finally the manned response by the operator is completed.
  • the learning processing unit 31d displays the text information related to the user's request that the response processing unit 31b could not respond to on the operator terminal 70, and the request that the operator actually determines with respect to this text information.
  • Ask for intent input The learning processing unit 31d acquires information obtained by combining the text information of the user's request and the intention input by the operator as teacher data from the operator terminal 70 and stores it in the storage unit 32.
  • the learning processing unit 31d relearns the intention identification model 81 stored in the storage unit 32 using the newly stored teacher data when a predetermined amount of teacher data is accumulated or each time the teacher data is acquired. To do.
  • the re-learning of the intention identification model 81 may be performed by another server device, and the management server 30 may only store the teacher data.
  • FIG. 11 is a block diagram showing the configuration of the smartphone 40 according to the present embodiment.
  • the smartphone 40 according to the present embodiment includes a processing unit (processor) 41, a storage unit (storage) 42, a communication unit (transceiver) 43, a touch panel 44, a camera 45, and the like.
  • the processing unit 41 is configured by using an arithmetic processing unit such as a CPU or MPU.
  • the processing unit 41 reads and executes the program 42a stored in the storage unit 42 to perform various processing such as a processing for accessing the management server 30 and a processing for realizing a chat.
  • the storage unit 32 is configured by using a non-volatile memory element such as a flash memory.
  • the storage unit 42 stores various programs executed by the processing unit 41 and various data required for processing by the processing unit 41.
  • the storage unit 42 stores the program 42a executed by the processing unit 41.
  • the program 42a may be written to the storage unit 42, for example, at the manufacturing stage of the smartphone 40.
  • the smartphone 40 may acquire what is distributed by a remote server device or the like by communication.
  • the smartphone 40 may read the program 42a recorded on a recording medium such as a memory card or an optical disk and store it in the storage unit 42.
  • the writing device may read out what is recorded on the recording medium and write it in the storage unit 42 of the smartphone 40.
  • the program 42a may be provided in a mode of distribution via a network, or may be provided in a mode recorded on a recording medium.
  • the communication unit 43 can communicate with various devices via the network N including the Internet, wireless LAN, mobile phone communication network, and the like.
  • the communication unit 43 communicates with the translation server 60 via the network N.
  • the communication unit 43 transmits the data given by the processing unit 41 to the translation server 60, and gives the data received from the translation server 60 to the processing unit 41.
  • the touch panel 44 is one of the user interfaces included in the smartphone 40, and has a display unit and an input unit.
  • the display unit of the touch panel 44 is configured by using a liquid crystal display or the like, and displays various images, characters, and the like based on the processing of the processing unit 41.
  • the input unit of the touch panel 44 is provided with a sensor for detecting contact by the user on the surface of the display unit, and receives and accepts a touch (tap) operation or the like for an image or the like displayed on the display unit as input. Is notified to the processing unit 41.
  • the camera 45 takes an image of the surroundings and gives the captured image information to the processing unit 41.
  • the camera 45 of the smartphone 40 is used for capturing the access information 20a displayed on the touch panel 25 of the smart speaker 20.
  • the processing unit 41 reads and executes the program 42a stored in the storage unit 42, so that the access information acquisition unit 41a, the chat processing unit 41b, and the like are software-like functional blocks. Is realized as.
  • the access information acquisition unit 41a performs a process of acquiring the access information 20a of the two-dimensional code displayed on the touch panel 25 by the smart speaker 20. For example, the access information acquisition unit 41a displays a message prompting the image of the access information 20a displayed on the smart speaker 20 on the touch panel 44, and acquires an image of the access information 20a imaged by the camera 45. Based on the acquired image, the access information acquisition unit 41a acquires the URL for accessing the chatbot included in the access information 20a, the identification information of the accommodation facility and the guest room where the smart speaker 20 is installed, and the like. To do.
  • the chat processing unit 41b accesses the management server 30 that provides the chatbot (via the translation server 60) based on the access information acquired by the access information acquisition unit 41a.
  • the chat processing unit 41b displays the language selection box shown in FIG. 4 and accepts the selection of the language used for chat, and then displays the chat screen shown in FIG.
  • the chat processing unit 41b acquires the text information of the request input by the user, and transmits the acquired text information to the management server 30 via the translation server 60. Further, the chat processing unit 41b receives the text information of the response transmitted from the management server 30 via the translation server 60, and displays the received text information on the chat screen.
  • the chat processing unit 41b can proceed with the chat by the same processing regardless of whether the chat partner is an automatic response by the chatbot or a manned response of the operator.
  • ⁇ Response switching process> In the information providing system according to the present embodiment, three types of information providing methods are provided: a voice response using the smart speaker 20, an automatic response by a chatbot using the smartphone 40, and a manned response by the operator using the smartphone 40. It is provided to users. Further, in the information providing system according to the present embodiment, the management server 30 determines the determination of switching from the voice response of the smart speaker 20 to the automatic response of the chatbot and switching from the automatic response of the chatbot to the manned response of the operator. By doing this and encouraging the user to switch the information provision method, smooth switching of the information provision method is realized.
  • the voice processing server 50 transmits Japanese text information to the management server 30, and if it is English, the voice processing server 50 transmits English text information to the management server 30.
  • the voice processing server 50 sends a notification to the management server 30 that the voice information cannot be converted into text information.
  • the management server 30 acquires the response information to the user's request using the intention identification model 81 and the response information DB 82 based on the text information received from the voice processing server 50.
  • the management server 30 transmits this response information as text information to the voice processing server 50, and the voice processing server 50 converts the text information into voice information and transmits it to the smart speaker 20.
  • the smart speaker 20 receives the voice information from the voice processing server 50, and outputs the received voice information from the speaker 27 as a response to the user's request.
  • the management server 30 when the management server 30 cannot acquire the response information even by using the intention identification model 81 and the response information DB 82 based on the text information received from the voice processing server 50, the management server 30 changes from the voice response to the automatic response of the chatbot.
  • a switching instruction prompting the switching is transmitted to the smart speaker 20 via the voice processing server 50.
  • the management server 30 generates the access information 20a of the two-dimensional code including the information such as the URL for accessing the chatbot and the identification information of the accommodation facility and the guest room where the smart speaker 20 is installed. Send along with the switching instruction.
  • the reason is that the voice processing server 50 cannot convert the voice information into the text information because the voice input to the smart speaker 20 is unclear.
  • the voice processing server 50 gives a notification to the management server 30 that the voice processing was not possible, and it can be determined that the management server 30 cannot acquire the response information by this notification.
  • the intention identification model 81 cannot identify the intention of the request.
  • the intention identification model 81 is configured to output the intention of the identified request and the certainty of the identification result for the input request, when the output certainty does not exceed a predetermined threshold value.
  • the management server 30 can determine that the intention identification model 81 cannot identify the intention and cannot acquire the response information.
  • the reason is that the intention of the request identified by the intention identification model 81 and the response information corresponding thereto are not registered in the response information DB 82.
  • the management server 30 searches the response information DB 82 based on the intention of the request output by the intention identification model 81, and can determine that the response information cannot be acquired when the corresponding item does not exist in the response information DB 82.
  • the method of determining that the management server 30 cannot acquire the response information corresponding to the request is not limited to the above method, and various methods can be adopted.
  • the smart speaker 20 that receives the switching instruction from the management server 30 via the voice processing server 50 outputs a message or the like inquiring whether to switch to the automatic response of the chatbot by voice from the speaker 27 or the touch panel 25. Is displayed in, and the user inquires whether or not switching is possible. When a response affirming the switching is obtained from the user by voice input or an operation on the touch panel, the smart speaker 20 displays the two-dimensional code of the access information 20a attached to the switching instruction on the touch panel 25.
  • the smartphone 40 displays the language selection box shown in FIG. 4 and allows the user to select the language to be used in the chat.
  • the smartphone 40 displays a chat screen in the selected language.
  • the smartphone 40 accepts the text input of the request by the user and outputs the text of the response by the management server 30.
  • the smartphone 40 reads the access information 20a by an application program or the like that reads a two-dimensional code, and accesses the URL included in the read access information 20a by a program such as a browser. It shall be.
  • the chat screen is displayed on the browser, and the process for displaying the chat screen is performed by the cooperation of the management server 30 and the smartphone 40.
  • the management server 30 may automatically select a language based on, for example, the identification information of the accommodation facility and the guest room included in the access information 20a. For example, the management server 30 acquires the identification information of the accommodation facility and the guest room from the smartphone 40, and stays in the guest room of the accommodation facility identified by the identification information from another server device or the like that manages the guests of the accommodation facility. By acquiring information such as the nationality of the user, the language used for chatting can be selected.
  • FIG. 12 is a flowchart showing a procedure of processing performed by the smart speaker 20 according to the present embodiment.
  • the processing unit 21 of the smart speaker 20 according to the present embodiment determines whether or not the voice input of the request by the user to the microphone 26 has been made (step S1). When no voice input is made (S1: NO), the processing unit 21 waits until the request voice input is made. When voice input is made (S1: YES), the processing unit 21 acquires the voice input by the microphone 26 as voice information of digital data (step S2). The processing unit 21 transmits the acquired voice information to the voice processing server 50 by the communication unit 23 (step S3).
  • the processing unit 21 determines whether or not a response to the transmitted request has been received (step S4). If no response has been received (S4: NO), the processing unit 21 waits until the response is received.
  • the response received by the smart speaker 20 is voice response information to the request or an instruction to switch to the chatbot.
  • the processing unit 21 determines whether or not the received response is an instruction to switch to the chatbot (step S5). If it is not a switching instruction (S5: NO), the processing unit 21 outputs the voice information of the received response from the speaker 27 (step S6), and ends the processing.
  • the processing unit 21 inquires the user whether or not the switching is possible by outputting a voice from the speaker 27 or displaying a message or the like inquiring whether or not the switching is possible on the touch panel 25. (Step S7).
  • the processing unit 21 determines whether or not an answer affirming the switching has been obtained in response to this inquiry (step S8).
  • the processing unit 21 displays the two-dimensional code access information 20a given from the management server 30 together with the switching instruction on the touch panel 25 (step S9), and processes the information. To finish. If no answer affirming the switching is obtained (S8: NO), the processing unit 21 ends the process without displaying the access information 20a.
  • FIG. 13 is a flowchart showing the procedure of the switching control process performed by the management server 30 according to the present embodiment.
  • the request acquisition unit 31a of the processing unit 31 of the management server 30 according to the present embodiment determines whether or not the communication unit 33 has received the request based on the text information from the voice processing server 50 (step S21). If the request has not been received (S21: NO), the request acquisition unit 31a waits until the request is received.
  • the response processing unit 31b of the processing unit 31 inputs the text information of the received request into the intention identification model 81 stored in the storage unit 32 (step S22).
  • the response processing unit 31b acquires the identification result of the intention of the request output by the intention identification model 81 (step S23). Based on the acquired identification result, the response processing unit 31b determines whether or not the intention of the request can be identified (step S24).
  • the response processing unit 31b refers to the response information DB 82 stored in the storage unit 32 based on the identified intention (step S25).
  • the response processing unit 31b refers to the response information DB 82 and determines whether or not the response information has been obtained (step S26). When the response information is obtained (S26: YES), the response processing unit 31b transmits the response information acquired from the response information DB 82 to the smart speaker 20 of the request transmission source via the voice processing server 50 (step S27). , End the process.
  • the processing unit 31 When the intention of the request cannot be identified by the intention identification model 81 (S24: NO), or when the response information for the intention of the request cannot be obtained from the response information DB 82 (S26: NO), the processing unit 31 The switching processing unit 31c of the above generates access information 20a for accessing the chatbot (step S28). The switching processing unit 31c transmits a switching instruction including the generated access information 20a to the smart speaker 20 of the request transmission source via the voice processing server 50 (step S29), and ends the processing.
  • FIG. 14 is a flowchart showing a procedure of chat start processing performed by the smartphone 40 according to the present embodiment.
  • the processing unit 41 of the smartphone 40 according to the present embodiment starts an application program in response to, for example, a user operation, and shifts to a mode for reading a two-dimensional code (step S41).
  • the access information acquisition unit 41a of the processing unit 41 acquires the access information 20a by capturing the access information 20a with the camera 45 (step S42).
  • the access information acquisition unit 41a acquires the URL and the identification information included in the acquired access information 20a (step S43).
  • the chat processing unit 41b of the processing unit 41 accesses the management server 30 based on the acquired URL and the identification information (step S44).
  • the management server 30 can determine which accommodation facility or guest room user is accessing. it can.
  • the chat processing unit 41b displays a language selection box and accepts the selection of the language used for chat (step S45).
  • the chat processing unit 41b starts a chat process with the chatbot provided by the management server 30 in the selected language (step S46), and ends the process.
  • the user who has started the chat using the smartphone 40 can input the request by text input.
  • the text information of the input request is transmitted from the smartphone 40 to the translation server 60, and the text information is translated by the translation server 60 as necessary.
  • the translation server 60 transmits the translated text information or the text information that does not need to be translated to the management server 30. If translation of the text information is not required, the text information may be directly transmitted from the smartphone 40 to the management server 30.
  • the management server 30 acquires the response information to the user's request using the intention identification model 81 and the response information DB 82 based on the text information received from the translation server 60.
  • the management server 30 transmits this response information as text information to the translation server 60, and the translation server 60 translates the text information as needed.
  • the translation server 60 transmits the translated text information or the text information that does not need to be translated to the smartphone 40.
  • the smartphone 40 receives the text information from the translation server 60 and displays the received text information on the chat screen as a response to the user's request.
  • the management server 30 when the management server 30 cannot acquire the response information even by using the intention identification model 81 and the response information DB 82 based on the text information received from the translation server 60, the management server 30 makes a manned response of the operator from the automatic response of the chatbot. A switching instruction prompting the user to switch to is transmitted to the smartphone 40 via the translation server 60.
  • the management server 30 may acquire response information. For example, the fact that the intention identification model 81 cannot identify the intention of the request can be mentioned as a factor. Further, for example, the reason is that the intention of the request identified by the intention identification model 81 and the response information corresponding thereto are not registered in the response information DB 82. Various methods can be adopted as a method for determining that the management server 30 cannot acquire the response information corresponding to the request.
  • the smartphone 40 which receives the switching instruction from the management server 30 via the translation server 60, displays a message or the like inquiring whether or not to switch to the manned response of the operator on the chat screen, and inquires whether or not the user can switch. .. In response to this inquiry, the user can switch to a manned response by, for example, touching the switching icon provided on the chat screen. When the user gives an affirmative answer to the switch, the smartphone 40 transmits a request for switching to the manned response to the management server 30 via the translation server 60.
  • the management server 30 that has received the switching request from the smartphone 40 acquires the date and time information at that time, and refers to the operator DB 83 stored in the storage unit 32 based on the acquired date and time information.
  • the management server 30 determines an operator requesting a manned response based on the date and time information and the operator DB 83, and transmits an instruction to perform the manned response to the operator terminal 70 used by the determined operator.
  • the management server 30 may determine the operator to switch to in consideration of the languages that the operator can handle.
  • the management server 30 transmits information such as the history and the user so far and displays it on the operator terminal 70 before the operator starts to respond to the manned response.
  • the history so far may include, for example, text information indicating the content of the voice input / output by the smart speaker 20, text information input / output by the automatic response of the chatbot, and the like.
  • the management server 30 stores this information as history information in the storage unit 32.
  • the information about the user can be, for example, information such as the gender, age, and nationality of the user, and is acquired from another server device or the like that stores information about the guest of the accommodation facility. By displaying this information on the operator terminal 70 in advance, the operator who makes a manned response can smoothly start chatting with the user.
  • the management server 30 relays the transmission and reception of text information between the smartphone 40 and the operator terminal 70 to establish a chat using the text information of the user and the operator.
  • the management server 30 requests the operator who made the manned response to create teacher data for re-learning the intention identification model 81.
  • the management server 30 inquires, for example, the operator terminal 70 that has completed the manned response to the user what the intention of the user's request was.
  • the management server 30 may transmit the text information input by the user in the voice response or the automatic response of the chatbot before the manned response to the operator terminal 70, and display the text information on the operator terminal 70. ..
  • the operator Based on the message or the like displayed on the operator terminal 70, the operator inputs what the intention of the request of the user who made the manned response was.
  • the operator terminal 70 transmits the information input by the operator to the management server 30.
  • the management server 30 receives the information from the operator terminal 70, creates teacher data in which the text information of the request related to the user's input and the intention of the request input by the operator are associated with each other, and stores the teacher data in the storage unit 32.
  • the information input by the operator at the operator terminal 70 can be used not only for learning the intention identification model 81, but also for adding or modifying the response information to the response information DB 82.
  • FIG. 15 is a flowchart showing a procedure of chat processing performed by the smartphone 40 according to the present embodiment.
  • the chat processing unit 41b of the processing unit 41 of the smartphone 40 according to the present embodiment accepts the input of the request by the user based on the operation on the touch panel 44 (step S51).
  • the chat processing unit 41b transmits the text information of the received request to the translation server 60 by the communication unit 43 (step S52).
  • the chat processing unit 41b determines whether or not a response to the transmitted request has been received (step S53). If no response has been received (S53: NO), the chat processing unit 41b waits until the response is received.
  • the response received by the smartphone 40 is text information of the response to the request or an instruction to switch to the manned response by the operator.
  • the chat processing unit 41b determines whether or not the received response is an instruction to switch to the manned response (step S54). If it is not a switching instruction (S54: NO), the chat processing unit 41b displays the text information of the received response on the chat screen (step S55), and ends the process.
  • the chat processing unit 41b inquires the user whether or not the switching is possible by displaying a message inquiring whether or not the switching is possible (step S56). The chat processing unit 41b determines whether or not an answer affirming the switching has been obtained for this inquiry (step S57). When an answer affirming the switching is obtained (S57: YES), the chat processing unit 41b transmits a request for switching to the manned response to the management server 30 via the translation server 60 (step S58), and performs processing. finish. If no answer affirming the switch is obtained (S57: NO), the chat processing unit 41b ends the process without transmitting the switch request.
  • FIG. 16 is a flowchart showing a procedure of switching control processing performed by the management server 30 according to the present embodiment.
  • the request acquisition unit 31a of the processing unit 31 of the management server 30 according to the present embodiment determines whether or not the communication unit 33 has received the request based on the text information from the translation server 60 (step S71). If the request has not been received (S71: NO), the request acquisition unit 31a waits until the request is received.
  • the response processing unit 31b of the processing unit 31 inputs the text information of the received request into the intention identification model 81 stored in the storage unit 32 (step S72).
  • the response processing unit 31b acquires the identification result of the intention of the request output by the intention identification model 81 (step S73).
  • the response processing unit 31b determines whether or not the intention of the request can be identified (step S74).
  • the response processing unit 31b refers to the response information DB 82 stored in the storage unit 32 based on the identified intention (step S75).
  • the response processing unit 31b refers to the response information DB 82 and determines whether or not the response information has been obtained (step S76). When the response information is obtained (S76: YES), the response processing unit 31b transmits the response information acquired from the response information DB 82 to the request transmission source smartphone 40 via the translation server 60 (step S77), and processes the response information. To finish.
  • the processing unit 31 When the intention of the request cannot be identified by the intention identification model 81 (S74: NO), or when the response information for the intention of the request cannot be obtained from the response information DB 82 (S76: NO), the processing unit 31 The switching processing unit 31c of the above transmits a switching instruction from the automatic response of the chatbot to the manned response by the operator to the smartphone 40 of the request transmission source via the voice processing server 50 (step S78), and ends the processing.
  • FIG. 17 is a flowchart showing the procedure of the switching process to the manned response performed by the management server 30 according to the present embodiment.
  • the switching processing unit 31c of the processing unit 31 of the management server 30 according to the present embodiment determines whether or not a request for switching to a manned response from the smartphone 40 has been received via the translation server 60 (step S91). When the request for switching to the manned response has not been received (S91: NO), the switching processing unit 31c waits until the request is received.
  • the switching processing unit 31c acquires information on the date and time at that time (step S92).
  • the date and time information can be obtained from, for example, an operating system running on the management server 30.
  • the switching processing unit 31c refers to the operator DB 83 stored in the storage unit 32 based on the acquired date and time information (step S93).
  • the switching processing unit 31c determines an operator to perform a manned response based on the date and time information and the operator DB 83 (step S94).
  • the switching processing unit 31c gives an instruction to give a response to the operator terminal 70 used by the determined operator (step S95).
  • the processing unit 41 transmits the user's request from the smartphone 40 to the operator terminal 70, and the operator's response from the operator terminal 70 is transmitted to the smartphone 40 to relay the request and the response (step S96). ..
  • the processing unit 41 determines whether or not the manned response is completed (step S97). When the manned response is not completed (S97: NO), the processing unit 41 returns the processing to step S96 and continuously relays the request and the response.
  • the processing unit 41 inquires of the operator terminal 70 that has performed the manned response the intention of the user's request (step S98). Based on the operator's response to this inquiry, the processing unit 41 generates teacher data in which the text information of the request and the intention of the request are associated with each other and stores it in the storage unit 32 (step S99), and ends the processing.
  • the learning (re-learning) process of the intention identification model 81 of the management server 30 is performed using the teacher data created as a result of the manned response by the operator.
  • the re-learning of the intention identification model 81 is performed at a periodic timing such as once a month, or at a timing when the accumulated amount of teacher data exceeds a predetermined amount.
  • FIG. 18 is a flowchart showing a procedure of learning processing of the intention identification model 81 performed by the management server 30 according to the present embodiment.
  • the learning processing unit 31d of the processing unit 31 of the management server 30 according to the present embodiment determines whether or not the timing for re-learning the intention identification model 81 has been reached (step S111). When the timing for re-learning has not been reached (S111: NO), the learning processing unit 31d waits until the timing for re-learning is reached.
  • the learning processing unit 31d reads out the learning data stored in the storage unit 32 (step S112). Further, the learning processing unit 31d reads out the intention identification model 81 stored in the storage unit 32. The learning processing unit 31d performs a process of learning the intention identification model 81 using the read learning data (step S114). The learning processing unit 31d stores the learned intention identification model 81 that has completed the learning processing in the storage unit 32 (step S115), and ends the processing.
  • FIG. 19 is a schematic diagram for explaining information provision before accommodation by the information provision system according to the present embodiment.
  • the user can make an accommodation reservation for the accommodation facility to the reservation server 80. After the reservation of the accommodation by the user is completed, the reservation server 80 transmits the access information for accessing the information providing system that provides the information of the accommodation facility to the reserved user.
  • the access information from the reservation server 80 to the user can be transmitted, for example, by sending an e-mail to the e-mail address registered as the information related to the user.
  • a URL for accessing the information providing system or the management server 30, identification information of the reserved accommodation facility, and the like may be attached to the e-mail to the user as access information.
  • the e-mail transmitted by the reservation server 80 is received by, for example, the user's smartphone 40, and the user accesses the information providing system on the smartphone 40 based on the access information attached to the e-mail.
  • the user can use the information provision by the automatic response of the chatbot provided by the management server 30.
  • an image of a two-dimensional code of access information may be attached to the e-mail.
  • the user displays the two-dimensional code on the smartphone 40, reads the two-dimensional code using the camera 28 with his / her own smart speaker 20 installed at home, and accesses the information providing system with the smart speaker 20. You may. In this case, the user can use the information provision by the voice response provided by the management server 30 on the smart speaker 20.
  • the transmission of the access information from the reservation server 80 to the user may be by mail.
  • a letter or card printed with a two-dimensional code of access information is enclosed in the accommodation facility information mailed to the user.
  • the user can read the two-dimensional code with his / her own smartphone 40 or smart speaker 20 and use the information providing system with his / her own smartphone 40 or smart speaker 20.
  • the management server 30 of the information providing system can make a similar response to a request from a user regardless of whether the guest is a guest or a reservation guest. However, the management server 30 may set a limit on the information given as a response to the reserved guest user. Further, the information providing system may provide information to users other than the guest and the reserved guest, for example, a user who is considering staying. In this case, the user can use the information providing system by accessing the website of the accommodation facility or the like using his / her own smartphone 40 or the smart speaker 20 and issuing a request such as an inquiry.
  • the smart speaker 20 installed in the guest room of the accommodation facility acquires a request based on the voice input from the user, and the response information to the acquired request is obtained by the smart speaker 20. Output audio with.
  • the management server 30 switches to the automatic response by text input / output.
  • the response information for the request cannot be output
  • the response information corresponding to the user's request cannot be obtained from the response information DB 82 and an appropriate response cannot be performed for the request
  • the request is made. This is the case when it is not possible to make a unique response for each.
  • the user's request includes the case where the intention cannot be identified by the intention identification model 81, or the case where the response information corresponding to the response information DB 82 is not stored.
  • a predetermined message such as "Cannot respond to the request” may be uniformly responded to the user's request, for example, when the intention of the request cannot be identified.
  • Such a response is not a unique response for each request and is not an appropriate response to the request. Even when such a response is performed, in the present embodiment, it is included in the case where the response information for the request cannot be output.
  • the management server 30 When switching from voice input / output to text input / output, the management server 30 displays access information 20a for accessing the automatic response by text input / output on the touch panel 25 of the smart speaker 20. As a result, the user can easily switch from voice input to text input / output by using the access information 20a displayed on the smart speaker 20.
  • the management server 30 receives access from the smartphone 40 based on the access information 20a displayed by the smart speaker 20, acquires a request based on the text input to the smartphone 40, and responds to the acquired request.
  • the response information is output to the smartphone 40 as a text.
  • the user can input text using his / her familiar smartphone 40 and obtain a response to the request.
  • the management server 30 switches to a manned response by the operator when the response information cannot be output in response to the request based on the input of the text information. As a result, the operator can reliably respond to the user's request that cannot be handled by the automatic response.
  • the management server 30 stores the operator's response according to the date and time in the operator DB 83, refers to the operator DB 83 based on the date and time information for switching, and is the operator to which the manned response is switched. To determine. As a result, the management server 30 can appropriately determine the switching destination of the manned response according to, for example, the working time of the operator center, and can reliably perform the manned response to the request from the user.
  • the management server 30 uses an intention identification model 81 learned to receive information related to a user's request as input, identify the intention of the request, and output the information.
  • the management server 30 acquires the information related to the user's request and inputs it to the intention identification model 81, acquires the intention of the request output by the intention identification model 81, and inputs the response information to the acquired request intention to the smart speaker 20 or the smart speaker 20. Output to the smartphone 40.
  • the management server 30 relearns the intention identification model 81 based on the result of the manned response. As a result, the discriminative ability of the intention identification model 81 can be improved, and the accuracy of the automatic response to the user's request can be improved.
  • the management server 30 stores the response information for the request in the response information DB 82, and the response information DB 82 is provided for each accommodation facility.
  • the smart speaker 20 and the smartphone 40 transmit identification information for identifying the accommodation facility and the guest room together with the request input by the user.
  • the management server 30 uses the response information DB 82 corresponding to the accommodation facility based on the identification information received together with the request from the user, and acquires the response information for the request.
  • the management server 30 outputs the acquired response information from the smart speaker 20 or the smartphone 40. As a result, the management server 30 can provide the user with response information suitable for the accommodation facility.
  • the information providing system also provides information to the user before staying.
  • the reservation server 80 transmits access information to the smartphone 40 of the user who has reserved accommodation at the accommodation facility.
  • the user accesses the information providing system with the smartphone 40 based on the access information, and the management server 30 accepts the access from the user's smartphone 40 based on the access information.
  • the information providing system can provide information not only to the users staying at the accommodation facility but also to the users before staying at the accommodation facility.
  • the user can acquire various information about the accommodation facility before staying and prepare for a trip or the like.
  • the user interface installed in the accommodation facility is the smart speaker 20, but the present invention is not limited to this.
  • the user interface may be any device as long as it can perform audio input / output by a microphone and a speaker.
  • a personal computer to which a microphone and a speaker are connected can be used as a user interface.
  • the terminal device used by the user is the smartphone 40, but the present invention is not limited to this.
  • the terminal device may be any device capable of input / output text, such as a personal computer, a tablet terminal device, or a mobile phone.
  • the user before switching from the voice response to the chatbot automatic response and switching from the chatbot automatic response to the operator's manned response, the user is inquired whether or not the switch is possible.
  • the management server 30 may automatically perform these switching without making an inquiry to the user.
  • the smart speaker 20 is configured to display the access information 20a as a two-dimensional code, but the present invention is not limited to this.
  • the access information 20a may be displayed by the smart speaker 20 as character information including information such as a URL, a login ID, and a password.
  • the output method of the access information 20a may be any.
  • a plurality of server devices of the management server 30, the voice processing server 50, the translation server 60, and the reservation server 80 share the processing to realize the information providing system.
  • these plurality of server devices may be realized as one server device, or the processing may be shared by, for example, five or more server devices.
  • the management server 30 may perform processing of the voice processing server 50 or the translation server 60.
  • the management server 30 may perform the processing of the reservation server 80.
  • the server device that holds and learns the intention identification model 81 may be a device different from the management server 30.
  • the management server 30 is configured to respond to the request by using the intention identification model 81 and the response information DB 82, but the present invention is not limited to this.
  • the management server 30 may use a trained model that accepts the input of the text information of the request and outputs the response information to the request.
  • the management server 30 may use a database in which the text information of the request and the response information are associated with each other without using the trained model. The management server 30 may respond to the request in any way.
  • the smart speaker 20 first makes a voice response to the user's request, and when it is determined that an appropriate voice response cannot be made, the chatbot switches to the response. If the chatbot further determines that an appropriate response cannot be made, the operator switches to a manned response. On the other hand, in the information providing system according to the second embodiment, when it is determined that the smart speaker 20 cannot perform an appropriate voice response, the user switches between a chatbot response and an operator manned response. Can be selected.
  • the user staying at the accommodation facility makes a request by utterance to the smart speaker 20 provided in the guest room, and the information of the response to the request is transmitted by the voice of the smart speaker 20. It can be obtained from the output.
  • the processing of the voice response performed by the smart speaker 20, the management server 30, the voice processing server 50, etc. of the information providing system may be the same as that described in the first embodiment.
  • the management server 30 When it is determined that the information cannot be provided by an appropriate voice response, for example, when the language spoken by the user is not included in the language supported by the voice response, the management server 30 according to the second embodiment is a chatbot. Prompts the user to switch to a response by or a manned response by the operator. At this time, the management server 30 displays the information for switching to the response by the chatbot and the information for switching to the manned response on the display unit 25a of the smart speaker 20, and allows the user to select which response to switch to. ..
  • FIG. 20 is a schematic diagram showing an example of access information displayed on the smart speaker 20 according to the second embodiment.
  • the smart speaker 20 displays on the touch panel 25 the access information 20a for switching to the response by the chatbot and the access information 20b for switching to the manned response of the call center. ..
  • the smart speaker 20 displays the message "Please read the code of the desired response method" at the top of the substantially circular touch panel 25, and below this message is for access for the chatbot.
  • the information 20a and the access information 20b for the call center are displayed side by side.
  • the display method of the access information 20a and 20b shown in FIG. 20 is an example, and is not limited to this.
  • the smart speaker 20 displays options such as a button or an icon that accepts a selection of either a chatbot or a call center on the touch panel 25, accepts a selection of a response method from the user based on a touch operation for the option, and selects and responds. Only the access information 20a and 20b corresponding to the method may be displayed on the touch panel 25. Further, for example, the smart speaker 20 may display information such as a telephone number for accessing the call center instead of the access information 20b.
  • the user can select the response method by reading either of the two access information 20a and 20b displayed on the smart speaker 20 with the camera of the smartphone 40.
  • the smartphone 40 acquires information such as a URL from the access information 20a, accesses the chatbot, and provides the user with a response service by the chatbot. .. Since the response by the chatbot is the same as that of the above-described embodiment, detailed description thereof will be omitted.
  • the smartphone 40 acquires information such as a telephone number or URL from the access information 20b and accesses the call center (that is, makes a call to the call center). ).
  • the operator and the user of the call center can perform voice communication (call) via the smartphone 40.
  • the operator and the user may not perform voice communication, but may chat with the operator via the operator terminal 70, the management server 30, and the like described in the above-described embodiment.
  • a plurality of operators corresponding to each language are resident in order to support a plurality of languages.
  • the smart speaker 20 determines a language used by the user, and displays access information 20b capable of performing voice communication with an operator corresponding to this language.
  • the language used by the user may be determined by the smart speaker 20, the management server 30, the voice processing server 50, or other devices.
  • the information providing system may perform language determination using one or more of these plurality of methods.
  • the language used by the user is determined based on the content of words, words, sentences, etc. that the user has uttered to the smart speaker 20.
  • the voice information related to the user's utterance is transmitted from the smart speaker 20 to the voice processing server 50, and the voice processing server 50 can convert the voice information into text information for the language to be processed. Therefore, the voice processing server 50 can determine whether or not the given voice information is the language to be processed, and if the voice information corresponds to a plurality of languages, which of the languages to be processed is the voice information. it can.
  • the voice processing server 50 may only determine the language without converting it into text information, and for example, the management server 30 may handle voice information that the voice processing server 50 cannot process.
  • the management server 30 may determine the language from the voice information, or the smart speaker 20 may determine the language from the voice information, for example.
  • a learning model in which machine learning is performed using teacher data in which voice information and its language type are associated can be used.
  • This learning model learns more languages than the language to be processed by the voice processing server 50, accepts voice information as input, and outputs information regarding the language type of the voice information.
  • the voice processing server 50, the management server 30, or the smart speaker 20 stores this learning model in advance, and can determine a language based on voice information for a language that is not a processing target of the voice processing server 50.
  • the language used by the user is determined based on the information stored in this database. be able to. For example, when information about a country or place such as a user's address or birthplace is stored in a database as a user's guest information, the language corresponding to this country or place is determined to be the language used by the user. Can be done.
  • a hotel management system called PMS Property Management System
  • PMS Processed Management System
  • the smart speaker 20 or the management server 30 can communicate with the PMS server, acquire information about the staying user, and determine the language used by this user.
  • the smart speaker 20 can exchange information with and from the smartphone 40 by using wireless communication such as NFC (Near Field Communication), Bluetooth (registered trademark), or wireless LAN.
  • the smartphone 40 is set in advance regarding the language used by the user, and the smart speaker 20 acquires the information of the language setting from the smartphone 40 by wireless communication to determine the language used by the user.
  • the smartphone 40 may make the determination instead of the smart speaker 20 determining the language. That is, the smartphone 40 that has read the access information 20b displayed on the smart speaker 20 determines the access destination according to the language set for itself when accessing the call center based on the access information 20b. You may.
  • the smart speaker 20 can receive a language selection operation from the user and determine the language used by the user. For example, the smart speaker 20 displays the same language selection box of the smartphone 40 shown in FIG. 4 on the touch panel 25, and accepts the language selection by the user on the touch panel 25. The smart speaker 20 may accept the language selection at an appropriate timing. The smart speaker 20 may accept language selection at the timing when the user first visits the accommodation facility and uses the smart speaker 20, for example, immediately before displaying the access information 20b for accessing the call center. Language selection may be accepted at the timing of.
  • any one of the above (1) to (4) is adopted or a plurality of devices are appropriately combined.
  • the smart speaker 20 displays the access information 20b according to the determination result of the language used on the touch panel 25.
  • a user who has read the access information 20b with the smartphone 40 can perform voice communication with the operator in the language used by the user by accessing the call center using the access information 20b.
  • the device such as the smart speaker 20, the management server 30, the smartphone 40, or the voice processing server 50 determines the language used by the user before displaying the access information 20b. It is a configuration, but it is not limited to this. For example, even if the access information 20b displayed by the smart speaker 20 is read by the smartphone 40 and the user accesses the call center with the smartphone 40 using the access information 20b, the language used by the user is determined. Good.
  • a device such as a server that manages a call center determines the language used by the user by adopting any one of (1) to (4) above or by appropriately combining a plurality of the above (1) to (4), and the determination result is obtained. Voice communication is performed between the operator and the user according to the above.
  • the access information 20b displayed by the smart speaker 20 may display different information depending on, for example, each accommodation facility, each room of the accommodation facility, each location of the accommodation facility, and the like. As a result, the call center can determine which accommodation facility or the like is inquiring according to the telephone number dialed by the smartphone 40. Further, the access information 20b may display different information depending on, for example, the time or day of the week, and different information depending on, for example, whether or not the call center is open or whether or not the operator is present. May be displayed.
  • FIG. 21 is a flowchart showing a procedure of processing performed by the smart speaker 20 according to the second embodiment.
  • the processing unit 21 of the smart speaker 20 according to the second embodiment transmits voice information related to a request from a user to the voice processing server 50, and outputs voice information received from the voice processing server 50 from the speaker 27. (Step S201).
  • the processing unit 21 determines whether or not a switching instruction for instructing switching from the voice response to the response by the chatbot or the response by the call center is given from the voice processing server 50 during the voice response processing (step). S202). When the switching instruction is not given (S202: NO), the processing unit 21 returns the processing to step S201 and continues the voice response processing.
  • step S202 When a switching instruction is given (S202: YES), the user is inquired about whether or not switching is possible by outputting a voice message or the like inquiring whether or not switching is possible from the speaker 27 or displaying it on the touch panel 25 (step S203).
  • the processing unit 21 determines whether or not an answer affirming the switching has been obtained in response to this inquiry (step S204). When an answer affirming the switching is not obtained (S204: NO), the processing unit 21 returns the processing to step S201 and continues the voice response processing.
  • the processing unit 21 determines the language used by the user (step S205).
  • the language determination may be performed at a timing earlier than this. Further, the language determination may be performed by a device other than the smart speaker 20, and in this case, the processing unit 21 of the smart speaker 20 directly or indirectly acquires the determination result from the device that has performed the language determination.
  • the processing unit 21 displays the access information 20a for switching to the response by the chatbot and the access information 20b for switching to the manned response by the call center on the touch panel 25. Is displayed (step S206), and the process ends.
  • FIG. 22 is a flowchart showing a procedure of voice communication start processing performed by the smartphone 40 according to the second embodiment.
  • the processing unit 41 of the smartphone 40 according to the second embodiment starts an application program in response to, for example, a user operation, and shifts to a mode for reading a two-dimensional code (step S211).
  • the access information acquisition unit 41a of the processing unit 41 captures the access information 20b displayed by the smart speaker 20 with the camera 45, so that the access information 20b for switching to the manned response by the call center is performed.
  • the access information acquisition unit 41a acquires information such as a telephone number included in the acquired access information 20b (step S213).
  • the processing unit 41 accesses the call center based on the acquired information such as the telephone number (step S214).
  • the processing unit 41 starts voice communication between the user and the call center (step S215), and ends the processing.
  • the smart speaker 20 installed in the guest room of the accommodation facility acquires a request based on the voice input from the user, and the response information to the acquired request is obtained by the smart speaker 20. Output audio with.
  • the management server 30 and the smart speaker 20 switch to either an automatic response by text input / output or a manned response by the operator.
  • the smart speaker 20 displays the access information 20a for switching to the automatic response by text input / output and the access information 20b for switching to the manned response by the operator, and tells the user which switching is to be performed. Let me choose. The user can select which response method to switch to by reading any of the access information 20a or 20b with his / her smartphone 40.
  • the user can select and switch between the response by the chatbot and the manned response by the operator when the desired result cannot be obtained by the voice response using the smart speaker 20. Since the user can switch to the response method according to his / her own preference, it can be expected to provide information suitable for the user.
  • the information providing system determines the type of language used by the user, and switches to a manned response by an operator who can respond in the determined language. As a result, the user can smoothly perform voice communication with the operator.
  • the information providing system has a configuration in which the access information 20b displayed by the smart speaker 20 is read by the smartphone 40, and the user performs voice communication with the call center operator using the smartphone 40. , Not limited to this.
  • the smart speaker 20 may access the call center and the smart speaker 20 may perform voice communication by the user and the operator.
  • the information providing system according to the second embodiment does not have to be provided yesterday for the operator's manned response by chat using the operator terminal 70.
  • the information providing system according to the second embodiment described above switches to either an automatic response by text input / output or a manned response by an operator when it is not possible to output response information to a request by voice. ing.
  • the information providing system according to the third embodiment first switches to the automatic response by text input / output when the response information for the request by voice cannot be output. This can be performed in the same procedure as the information providing system according to the first embodiment.
  • the information providing system according to the first embodiment described above switches to the operator's manned response by text input / output when the automatic response by text input / output cannot respond to the user's request.
  • the information providing system according to the third embodiment cannot respond to the user's request by the automatic response by the text input / output
  • the information providing system according to the first embodiment has the same manned response by the text input / output. The user accepts the option of switching to or switching to a voice manned response by the call center operator.
  • the management server 30 of the information providing system automatically responds to the chatbot when the intention of the request from the user cannot be identified or when the response information for the intention of the request cannot be obtained. Sends the operator's instruction to switch to the manned response to the user's smartphone 40. This corresponds to, for example, the process shown in step S78 of the flowchart shown in FIG.
  • the smartphone 40 of the information providing system determines whether or not a switching instruction has been received from the management server 30 while performing an automatic response of the chatbot by text input / output. This corresponds to, for example, the process shown in step S54 of the flowchart shown in FIG.
  • the smartphone 40 selects whether to switch to the manned response by the operator's text input / output or to the manned response by the call center operator's voice input / output. Accept. At this time, the smartphone 40 displays a message on the touch panel 44 to switch to a manned response, for example, a button labeled "chat (text input / output)", and a label "call center (voice input / output)". The buttons marked with are displayed side by side. By accepting a touch operation on any of the buttons, the smartphone 40 accepts the selection of a manned response by text input / output or a manned response by voice input / output.
  • the smartphone 40 When switching to a manned response by text input / output is selected, the smartphone 40 transmits a request for switching to a manned response by text input / output to the management server 30. This corresponds to, for example, the process shown in step S58 of the flowchart shown in FIG.
  • the management server 30 that has received the request from the smartphone 40 performs a manned response by text input / output, for example, by processing the flowchart shown in FIG.
  • the smartphone 40 When switching to the manned response by voice input / output is selected, the smartphone 40 according to the third embodiment transmits a request for switching to the manned response by voice input / output to the management server 30.
  • the management server 30 Upon receiving this request, the management server 30 performs, for example, a process of determining the language used by the user to determine a call center, an operator, or the like suitable for the user.
  • the management server 30 transmits information such as a telephone number for making a call with the determined operator to the smartphone 40, or performs a process of establishing a call connection between the operator and the smartphone 40.
  • the smartphone 40 receives information such as a telephone number from the management server 30, it accesses the call center based on the received information and starts voice communication with the operator.
  • the information providing system selects switching to a manned response by text input / output or a manned response by voice input / output when the automatic response by text input / output cannot respond to the user's request. Accepts from the user and switches to the selected manned response.
  • the user can switch to the manned response by the operator when the desired result cannot be obtained by the automatic response by the chatbot, and can switch to either the text input / output or the voice input / output manned response. You can choose whether to do it. Since the user can switch to the response method according to his / her own preference, it can be expected to provide information suitable for the user.
  • Information provision system 20 Smart speaker (user interface) 20a Access information 21 Processing unit 22 Storage unit 22a Program 22b Identification information 23 Communication unit 25 Touch panel 25a Display unit 25b Input unit 26 Microphone 27 Speaker 28 Camera 30 Management server (information providing device) 31 Processing unit 31a Request acquisition unit 31b Response processing unit 31c Switching processing unit 31d Learning processing unit 32 Storage unit 32a Server program 33 Communication unit 40 Smartphone (terminal device) 41 Processing unit 41a Access information acquisition unit 41b Chat processing unit 42 Storage unit 42a Program 43 Communication unit 44 Touch panel 45 Camera 50 Voice processing server 60 Translation server 70 Operator terminal 80 Reservation server 81 Intention identification model (identifier) 82 Response information DB 83 Operator DB 99 Recording medium

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

Provided are an information providing method, an information providing system, an information providing device, and a computer program, which use voice and text input/output as appropriate to provide information. The information providing system according to the present embodiment comprises: a user interface installed in a lodging facility; a request acquisition unit that acquires a request based on a voice inputted to the user interface; a response information acquisition unit that acquires, from a database, response information corresponding to the request acquired by the request acquisition unit; a voice output processing unit that performs processing in which the user interface is caused to output, as voice, the response information acquired by the response information acquisition unit; and an information providing device having a processing unit that performs processing switching to an automatic response via text input/output if the response information cannot be acquired from the database.

Description

情報提供方法、情報提供システム、情報提供装置及びコンピュータプログラムInformation provision method, information provision system, information provision device and computer program
 本開示は、宿泊施設を利用するユーザに対して種々の情報を提供する情報提供方法、情報提供システム、情報提供装置及びコンピュータプログラムに関する。 This disclosure relates to an information providing method, an information providing system, an information providing device, and a computer program that provide various information to a user who uses an accommodation facility.
 装置に対してユーザが音声での入力を行うことが可能なシステムが普及し始めている。このようなシステムとして、例えばスマートスピーカ又はAI(Artificial Intelligence)スピーカ等と呼ばれる装置が家庭内に設置され始めている。ユーザは、スマートスピーカに対して音声での質問等を行うことにより、種々の情報をスマートスピーカら音声出力により得ることができる。またユーザは、スマートスピーカに対して音声での命令を行うことにより、家電製品の動作制御等を行うことも可能である。 Systems that allow users to input voice to devices are becoming widespread. As such a system, devices called, for example, smart speakers or AI (Artificial Intelligence) speakers have begun to be installed in homes. The user can obtain various information by voice output from the smart speaker by asking a question by voice to the smart speaker. The user can also control the operation of home appliances by issuing voice commands to the smart speaker.
 特許文献1においては、ユーザとロボットとの間で、自然言語による対話を成立させるコミュニケーションシステムが提案されている。ロボットは、マイク及びスピーカを有しており、マイクで取得したユーザの発話をサーバへ送信し、サーバから応答を受信し、応答をスピーカから音声出力する。サーバは、会話に用いられる問い及び回答の会話データベースを有し、この会話データベースを参照してユーザの発話に対する応答を生成し、ロボットへ送信する。 Patent Document 1 proposes a communication system that establishes a dialogue in natural language between a user and a robot. The robot has a microphone and a speaker, transmits the user's utterance acquired by the microphone to the server, receives a response from the server, and outputs the response by voice from the speaker. The server has a conversation database of questions and answers used for conversation, and refers to this conversation database to generate a response to a user's utterance and send it to a robot.
特開2017-191531号公報Japanese Unexamined Patent Publication No. 2017-191531
 音声入力が可能な装置が普及し始めているものの、現状の装置はユーザによる種々のリクエストの全てに対して的確な応答ができるわけではない。また音声入力は、テキスト入力と比較すると精度が低いという課題がある。 Although devices capable of voice input have begun to spread, the current devices cannot accurately respond to all various requests by users. Further, voice input has a problem that the accuracy is lower than that of text input.
 本開示は、斯かる事情に鑑みてなされたものであって、その目的とするところは、音声及びテキストの入出力を適宜に利用して情報を提供する情報提供方法、情報提供システム、情報提供装置及びコンピュータプログラムを提供することにある。 The present disclosure has been made in view of such circumstances, and the purpose of the present disclosure is to provide information by appropriately using input / output of voice and text, an information providing method, an information providing system, and information providing. To provide equipment and computer programs.
 本開示に係る情報提供方法は、宿泊施設に設置されたユーザインタフェースに入力された音声に基づくリクエストを取得し、取得したリクエストに対応する応答情報をデータベースから取得し、取得した応答情報を前記ユーザインタフェースに音声で出力させ、前記データベースから前記応答情報を取得できない場合に、テキストの入出力による自動応答へ切り替える。 The information providing method according to the present disclosure acquires a request based on the voice input to the user interface installed in the accommodation facility, acquires the response information corresponding to the acquired request from the database, and obtains the acquired response information from the user. The interface is output by voice, and when the response information cannot be obtained from the database, the response is switched to an automatic response by inputting / outputting text.
 本開示においては、宿泊施設に設置されたユーザインタフェースにてユーザからの音声入力に基づくリクエストを取得し、取得したリクエストに対する応答情報をユーザインタフェースにて音声で出力する。取得したリクエストに対する応答情報の出力を行うことができない場合には、テキストの入出力による自動応答への切り替えを行う。これによりユーザは、音声入出力からテキスト入出力への切り替えを容易に行うことが可能となる。 In the present disclosure, a request based on voice input from the user is acquired by the user interface installed in the accommodation facility, and the response information to the acquired request is output by voice in the user interface. If it is not possible to output the response information for the acquired request, switch to automatic response by inputting / outputting text. This allows the user to easily switch from voice input / output to text input / output.
 本開示による場合は、音声及びテキストの入出力を適宜に利用して、ユーザに対する情報の提供を行うことができる。 In the case of this disclosure, information can be provided to the user by appropriately using the input / output of voice and text.
本実施の形態に係る情報提供システムの概要を説明するための模式図である。It is a schematic diagram for demonstrating the outline of the information provision system which concerns on this Embodiment. 本実施の形態に係る情報提供システムの概要を説明するための模式図である。It is a schematic diagram for demonstrating the outline of the information provision system which concerns on this Embodiment. スマートスピーカに表示されるアクセス用情報の一例を示す模式図である。It is a schematic diagram which shows an example of access information displayed on a smart speaker. スマートフォンに表示されるチャット画面の一例を示す模式図である。It is a schematic diagram which shows an example of a chat screen displayed on a smartphone. スマートフォンに表示されるチャット画面の一例を示す模式図である。It is a schematic diagram which shows an example of a chat screen displayed on a smartphone. 本実施の形態に係るスマートスピーカの構成を示すブロック図である。It is a block diagram which shows the structure of the smart speaker which concerns on this embodiment. 本実施の形態に係る管理サーバの構成を示すブロック図である。It is a block diagram which shows the structure of the management server which concerns on this Embodiment. 意図識別モデルの一構成例を示す模式図である。It is a schematic diagram which shows one configuration example of an intention identification model. 応答情報DBの一構成例を示す模式図である。It is a schematic diagram which shows one configuration example of the response information DB. オペレータDBの一構成例を示す模式図である。It is a schematic diagram which shows one configuration example of an operator DB. 本実施の形態に係るスマートフォンの構成を示すブロック図である。It is a block diagram which shows the structure of the smartphone which concerns on this embodiment. 本実施の形態に係るスマートスピーカが行う処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of the process performed by the smart speaker which concerns on this embodiment. 本実施の形態に係る管理サーバが行う切替制御処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of the switching control processing performed by the management server which concerns on this Embodiment. 本実施の形態に係るスマートフォンが行うチャット開始処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of the chat start processing performed by the smartphone which concerns on this Embodiment. 本実施の形態に係るスマートフォンが行うチャット処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of the chat processing performed by the smartphone which concerns on this Embodiment. 本実施の形態に係る管理サーバが行う切替制御処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of the switching control processing performed by the management server which concerns on this Embodiment. 本実施の形態に係る管理サーバが行う有人応答への切替処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of the switching process to the manned response performed by the management server which concerns on this Embodiment. 本実施の形態に係る管理サーバが行う意図識別モデルの学習処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of the learning process of the intention identification model performed by the management server which concerns on this Embodiment. 本実施の形態に係る情報提供システムによる宿泊前の情報提供を説明するための模式図である。It is a schematic diagram for demonstrating the information provision before accommodation by the information provision system which concerns on this Embodiment. 実施の形態2に係るスマートスピーカに表示されるアクセス用情報の一例を示す模式図である。It is a schematic diagram which shows an example of the access information displayed on the smart speaker which concerns on Embodiment 2. FIG. 実施の形態2に係るスマートスピーカが行う処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of the process performed by the smart speaker which concerns on Embodiment 2. FIG. 実施の形態2に係るスマートフォンが行う音声通信開始処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of the voice communication start processing performed by the smartphone which concerns on Embodiment 2.
 本開示の実施形態に係る情報提供システムの具体例を、以下に図面を参照しつつ説明する。なお、本開示はこれらの例示に限定されるものではなく、請求の範囲によって示され、請求の範囲と均等の意味及び範囲内でのすべての変更が含まれることが意図される。 A specific example of the information providing system according to the embodiment of the present disclosure will be described below with reference to the drawings. It should be noted that the present disclosure is not limited to these examples, and is indicated by the scope of claims, and is intended to include all modifications within the meaning and scope equivalent to the scope of claims.
<システム概要>
 図1及び図2は、本実施の形態に係る情報提供システムの概要を説明するための模式図である。本実施の形態に係る情報提供システム10は、ホテルの客室、共用部又は民泊に用いる民家等の宿泊施設にユーザインタフェースとして設置されたスマートスピーカ20を用いてユーザに情報を提供することができる。図1に示すスマートスピーカ20は、略半球形状の筐体に、タッチパネル25、マイク26、スピーカ27及びカメラ28等が適宜に配置された構成である。ただしスマートスピーカ20の形状等は一例であってこれに限るものではなく、例えば円柱状又は直方体状等の種々の形状が採用され得る。また客室に設置されるユーザインタフェースはスマートスピーカ20に限らず、音声による入出力を行うことが可能な情報処理装置であれば、どのような装置であってもよい。
<System overview>
1 and 2 are schematic views for explaining the outline of the information providing system according to the present embodiment. The information providing system 10 according to the present embodiment can provide information to a user by using a smart speaker 20 installed as a user interface in an accommodation facility such as a hotel guest room, a common area, or a private house used for a private lodging. The smart speaker 20 shown in FIG. 1 has a configuration in which a touch panel 25, a microphone 26, a speaker 27, a camera 28, and the like are appropriately arranged in a substantially hemispherical housing. However, the shape of the smart speaker 20 is an example and is not limited to this, and various shapes such as a columnar shape or a rectangular parallelepiped shape can be adopted. The user interface installed in the guest room is not limited to the smart speaker 20, and may be any device as long as it is an information processing device capable of input / output by voice.
 客室に宿泊したユーザ(宿泊客)は、特定のキーワード(例えば「スタートサービス」等)の発話によりスマートスピーカ20を起動することができる。ユーザは、発話による音声入力によってスマートスピーカ20に種々のリクエストを与えることができ、スマートスピーカ20から出力される音声による応答情報により、例えばこの宿泊施設に関する情報などの様々な情報を得ることができる。図示の例では、ユーザはスマートスピーカ20に対して「チェックアウトの時間を教えて」と発話によるリクエストを与えている。なお本実施の形態においては、ユーザがスマートスピーカ20等に対して入力する種々の要求又は質問等をリクエストと呼ぶ。 A user (guest) staying in a guest room can activate the smart speaker 20 by speaking a specific keyword (for example, "start service"). The user can give various requests to the smart speaker 20 by voice input by utterance, and various information such as information about this accommodation facility can be obtained from the voice response information output from the smart speaker 20. .. In the illustrated example, the user gives an utterance request to the smart speaker 20 "Tell me the check-out time". In the present embodiment, various requests or questions that the user inputs to the smart speaker 20 or the like are referred to as requests.
 スマートスピーカ20に対してユーザが入力した音声のリクエストは、インターネット等のネットワークを介して音声処理サーバ50へ入力される。音声処理サーバ50は、音声入力に対して音声認識の処理を行うことによって、入力された音声の情報をテキスト(文字列)の情報に変換する。音声処理サーバ50は、変換したテキストを管理サーバ30へ入力する。 The voice request input by the user to the smart speaker 20 is input to the voice processing server 50 via a network such as the Internet. The voice processing server 50 converts the input voice information into text (character string) information by performing voice recognition processing on the voice input. The voice processing server 50 inputs the converted text to the management server 30.
 管理サーバ30は、音声処理サーバ50から入力されたテキストの情報に基づいて、ユーザのリクエストに対する応答情報を生成する処理を行う。この処理のために管理サーバ30は、意図識別モデル81及び応答情報DB(データベース)82を用いる。意図識別モデル81は、リクエストに係るテキスト情報を入力として受け付け、このリクエストに係るユーザの意図を識別した識別結果を出力するように予め学習された学習済モデルである。応答情報DB82は、リクエストに係る意図と、意図に対する応答メッセージのテキストとが対応付けて記憶されたデータベースである。なお本例では意図識別モデル81及び応答情報DB82を管理サーバ30が備える構成とするが、これに限るものではなく、意図識別モデル81又は応答情報DB82は他のサーバが備えてもよい。 The management server 30 performs a process of generating response information to a user's request based on the text information input from the voice processing server 50. For this process, the management server 30 uses the intention identification model 81 and the response information DB (database) 82. The intention identification model 81 is a trained model that has been trained in advance so as to accept text information related to a request as input and output an identification result that identifies the user's intention related to this request. The response information DB 82 is a database in which the intention related to the request and the text of the response message to the intention are stored in association with each other. In this example, the management server 30 includes the intention identification model 81 and the response information DB 82, but the present invention is not limited to this, and the intention identification model 81 or the response information DB 82 may be provided by another server.
 管理サーバ30は、音声処理サーバ50から得たテキスト情報を意図識別モデル81へ入力し、意図識別モデル81が出力する情報を取得することによって、ユーザのリクエストの意図を取得する。図1に示す例では「チェックアウト時間を教えて」のテキスト情報に対し、チェックアウトの時間を問い合わせているというユーザの意図が意図識別モデル81により識別される。次いで管理サーバ30は、識別されたユーザの意図に基づいて応答情報DB82を参照することによって、リクエストの意図に対する応答メッセージのテキスト情報を取得する。管理サーバ30は、応答情報DB82から取得したテキスト情報を音声処理サーバ50へ出力する。図1に示す例では、チェックアウトの時間を問い合わせているという意図に対し、応答情報として例えば「午前10時までにチェックアウトしてください」のテキスト情報が応答情報DB82に記憶されている。 The management server 30 inputs the text information obtained from the voice processing server 50 into the intention identification model 81, and acquires the information output by the intention identification model 81 to acquire the intention of the user's request. In the example shown in FIG. 1, the intention identification model 81 identifies the user's intention to inquire about the checkout time with respect to the text information of "tell me the checkout time". Next, the management server 30 acquires the text information of the response message for the intention of the request by referring to the response information DB 82 based on the intention of the identified user. The management server 30 outputs the text information acquired from the response information DB 82 to the voice processing server 50. In the example shown in FIG. 1, in response to the intention of inquiring about the check-out time, for example, the text information "Please check out by 10 am" is stored in the response information DB 82 as the response information.
 管理サーバ30から応答情報のテキスト出力が与えられた音声処理サーバ50は、与えられたテキスト情報を音声情報に変換する。音声処理サーバ50は、変換した音声情報をスマートスピーカ20へ出力する。スマートスピーカ20は、音声処理サーバ50から出力された音声情報を取得し、この音声情報に基づく音声出力をスピーカ27にて行う。図1に示す例では、スマートスピーカ20は「午前10時までにチェックアウトしてください」の音声出力を行っている。 The voice processing server 50 to which the text output of the response information is given from the management server 30 converts the given text information into voice information. The voice processing server 50 outputs the converted voice information to the smart speaker 20. The smart speaker 20 acquires the voice information output from the voice processing server 50, and outputs the voice based on the voice information to the speaker 27. In the example shown in FIG. 1, the smart speaker 20 outputs the voice "Please check out by 10 am".
 なお、ユーザによるスマートスピーカ20へのリクエストに対して、管理サーバ30は音声出力以外の応答を行うことも可能である。例えば管理サーバ30は、スマートスピーカ20のタッチパネル25に画像を表示することができる。この場合に、管理サーバ30は表示用の画像の情報を音声処理サーバ50へ出力し、音声処理サーバ50はこの画像情報をスマートスピーカ20へ与える。 Note that the management server 30 can also respond to a request from the user to the smart speaker 20 other than voice output. For example, the management server 30 can display an image on the touch panel 25 of the smart speaker 20. In this case, the management server 30 outputs image information for display to the voice processing server 50, and the voice processing server 50 gives this image information to the smart speaker 20.
 また上述の音声入出力によるユーザへの情報提供は、例えばユーザが発話した言語が音声処理サーバ50の音声処理の対応言語に含まれていない場合、また例えばユーザの発話内容が音声処理サーバ50により適切にテキスト情報へ変換できない場合等に、適切な情報をユーザに対して提供できない可能性がある。本実施の形態に係る情報提供システムでは、音声入出力による情報提供を適切に行うことができない場合に、テキストの入出力による情報提供への切り替えを行う。テキストの入出力によって自動的な応答を行うシステムはいわゆるチャットボットと呼ばれるものであり、本実施の形態に係る情報提供システムは、スマートスピーカ20を用いた情報提供と、チャットボットを用いた情報提供とを適宜に切り替えて行うシステムである。 Further, the above-mentioned information provision to the user by voice input / output is performed when, for example, the language spoken by the user is not included in the language corresponding to the voice processing of the voice processing server 50, or when the content of the user's speech is, for example, the voice processing server 50. When it is not possible to properly convert to text information, it may not be possible to provide appropriate information to the user. In the information providing system according to the present embodiment, when it is not possible to appropriately provide information by voice input / output, it switches to information provision by text input / output. A system that automatically responds by inputting / outputting text is a so-called chatbot, and the information providing system according to the present embodiment provides information using the smart speaker 20 and information providing using the chatbot. It is a system that switches between and appropriately.
 管理サーバ30は、例えば音声処理サーバ50から音声のテキストへの変換ができなかった旨を通知された場合、また例えば意図識別モデル81にてユーザのリクエストの意図が識別できなかった場合等に、チャットボットへの切り替えを促すメッセージをスマートスピーカ20に出力させる。メッセージは例えば「音声リクエストが認識できませんでした。チャットボットを利用しますか?」等とすることができる。このメッセージは、スマートスピーカ20が音声出力してもよく、タッチパネル25にテキストとして表示してもよい。このメッセージに対してチャットボットの利用を肯定する旨の意志がユーザにより示された場合、管理サーバ30は、スマートフォン40等の端末装置を利用してチャットボットにアクセスするためのアクセス用情報を、スマートスピーカ20に出力させる。チャットボットを利用するか否かのユーザの意志は、例えば音声入力として受け付けられてもよく、また例えばスマートスピーカ20のタッチパネル25に対するタッチ操作により受け付けられてもよい。 The management server 30 is notified, for example, that the voice processing server 50 cannot convert the voice into text, or when the intention identification model 81 cannot identify the intention of the user's request. A message prompting the user to switch to the chatbot is output to the smart speaker 20. The message can be, for example, "The voice request could not be recognized. Do you want to use a chatbot?" This message may be output by the smart speaker 20 as voice, or may be displayed as text on the touch panel 25. When the user indicates the intention to affirm the use of the chatbot in response to this message, the management server 30 provides access information for accessing the chatbot using a terminal device such as a smartphone 40. Output to the smart speaker 20. The user's will to decide whether or not to use the chatbot may be accepted, for example, as a voice input, or may be accepted, for example, by a touch operation on the touch panel 25 of the smart speaker 20.
 図3は、スマートスピーカ20に表示されるアクセス用情報の一例を示す模式図である。本実施の形態に係る情報提供システムにおいては、スマートスピーカ20がタッチパネル25にアクセス用情報20aを表示する。図示のアクセス用情報20aは二次元コード(例えばバーコード又はQRコード(登録商標))であり、例えばチャットボットにアクセスするためのURL(Uniform Resource Locator)等の情報をコード化したものである。また本実施の形態においてアクセス用情報20aには、宿泊施設及び客室等を識別する識別情報が含まれている。これによりチャットボットは、アクセス用情報20aに含まれる識別情報を取得し、宿泊施設及び客室等に適した自動応答を行うことができる。ただし、アクセス用情報20aは、URL等の情報のみを含み、宿泊施設及び客室等の識別情報を含まない構成であってもよい。この場合には、例えばユーザが宿泊施設及び客室等に関する情報を入力する構成としてもよい。 FIG. 3 is a schematic diagram showing an example of access information displayed on the smart speaker 20. In the information providing system according to the present embodiment, the smart speaker 20 displays the access information 20a on the touch panel 25. The illustrated access information 20a is a two-dimensional code (for example, a barcode or a QR code (registered trademark)), and is a code of information such as a URL (Uniform Resource Locator) for accessing a chatbot, for example. Further, in the present embodiment, the access information 20a includes identification information for identifying the accommodation facility, the guest room, and the like. As a result, the chatbot can acquire the identification information included in the access information 20a and perform an automatic response suitable for the accommodation facility, the guest room, and the like. However, the access information 20a may have a configuration that includes only information such as a URL and does not include identification information such as accommodation facilities and guest rooms. In this case, for example, the user may input information about accommodation facilities, guest rooms, and the like.
 ユーザは、スマートスピーカ20に表示されたアクセス用情報20aを自身のスマートフォン40のカメラで撮影することで、チャットボットにアクセスするためのURL等の情報を取得することができる。アクセス用情報20aを取得したユーザは、スマートフォン40にインストールされたブラウザ等のアプリケーションを用いてチャットボットのサービスを提供するサーバへアクセスすることによって、チャットボットの利用を開始することができる。本実施の形態においては、管理サーバ30がチャットボットのサービスを提供する。スマートフォン40の表示部にはチャット画面が表示され、ユーザはスマートフォン40のテキスト入力機能を利用して、チャット画面にリクエストをテキストとして入力することができる。 The user can acquire information such as a URL for accessing the chatbot by photographing the access information 20a displayed on the smart speaker 20 with the camera of his / her smartphone 40. The user who has acquired the access information 20a can start using the chatbot by accessing the server that provides the chatbot service using an application such as a browser installed on the smartphone 40. In the present embodiment, the management server 30 provides the chatbot service. A chat screen is displayed on the display unit of the smartphone 40, and the user can input the request as text on the chat screen by using the text input function of the smartphone 40.
 図4及び図5は、スマートフォン40に表示されるチャット画面の一例を示す模式図である。図4には、チャットボットの利用に先立って、ユーザに対する言語選択を促す選択ボックスの表示例が示されている。本実施の形態においてはチャットの言語として英語、日本語、中国語及び韓国語等が利用可能であり、選択ボックスには選択し得る言語が、その言語の表記で一覧表示される。ユーザは、選択ボックスに表示された言語に対するタッチ(タップ)操作を行うことで、いずれか1つの言語を選択することができる。 4 and 5 are schematic views showing an example of a chat screen displayed on the smartphone 40. FIG. 4 shows a display example of a selection box that prompts the user to select a language prior to using the chatbot. In the present embodiment, English, Japanese, Chinese, Korean, etc. can be used as the chat language, and the selectable languages are listed in the notation of the language in the selection box. The user can select any one language by performing a touch (tap) operation on the language displayed in the selection box.
 なお図4に示す言語の選択ボックスは、チャット開始に先立って表示されるのみでなく、図5に示すチャット画面の下部左側に設けられた言語選択アイコンに対するユーザのタッチ操作が行われた場合にも表示されてよい。言語選択アイコンには、その時点で選択されている言語がいずれであるかを示すラベルが付される。また図5のチャット画面には、言語選択アイコンの隣に、有人応答への切り替えを行うための切替アイコンが表示される。 The language selection box shown in FIG. 4 is not only displayed prior to the start of chat, but also when the user touches the language selection icon provided on the lower left side of the chat screen shown in FIG. May also be displayed. The language selection icon is labeled to indicate which language is currently selected. Further, on the chat screen of FIG. 5, a switching icon for switching to the manned response is displayed next to the language selection icon.
 本実施の形態においては、図4に示す選択ボックスによりユーザが言語を選択する構成とするが、これに限るものではない。例えば宿泊施設の宿泊客に関する情報を記憶したサーバ装置からユーザの国籍等の情報を取得し、このユーザに適した言語を自動的に選択する構成としてもよい。 In the present embodiment, the user selects the language from the selection box shown in FIG. 4, but the present invention is not limited to this. For example, information such as the nationality of the user may be acquired from a server device that stores information about the guests of the accommodation facility, and a language suitable for the user may be automatically selected.
 また図5には、チャットによるリクエスト及び応答の一例が示されている。図示の例では、チャットの言語として英語が選択されている。図示のチャット画面には、例えば上部に「ホテル情報サービス」、「Exampleホテル」及び「お問い合わせ」等のタイトル文字列が表示され、このチャット画面がホテルに対する問い合せを行うためのチャット画面であることが示されている。タイトル文字列の下方には、ユーザが入力したリクエストのテキストと、これに対する応答のテキストとが、時系列に上から下へ並べて表示される。またユーザが入力したテキストはチャット画面の右側に寄せて表示され、チャットボットによる応答のテキストは左側に寄せて表示される。本例ではユーザによる「I want a water.」のリクエストのテキストに対し、応答情報として「How many waters would you like to order?」のテキストが出力されている。 In addition, FIG. 5 shows an example of a request and response by chat. In the illustrated example, English is selected as the chat language. In the illustrated chat screen, for example, title character strings such as "hotel information service", "Example hotel", and "inquiry" are displayed at the top, and this chat screen is a chat screen for making inquiries to the hotel. It is shown. Below the title string, the text of the request entered by the user and the text of the response to it are displayed in chronological order from top to bottom. In addition, the text entered by the user is displayed on the right side of the chat screen, and the text of the response by the chatbot is displayed on the left side. In this example, the text "How many waters would you like to order?" Is output as the response information to the text of the request "I want a water." By the user.
 スマートフォン40にて入力されたテキストの情報は、インターネット等のネットワークを介して翻訳サーバ60へ入力される。翻訳サーバ60は、入力されたテキストの情報を、必要に応じて翻訳する。翻訳サーバ60は、管理サーバ30が対応可能な言語以外の言語によるテキストが入力された場合に、このテキストを管理サーバ30が対応可能な言語に翻訳する。例えば管理サーバ30が日本語及び英語に対応可能であり、中国語のリクエストが入力された場合には、翻訳サーバ60はリクエストを中国語から日本語又は英語へ翻訳し、翻訳したテキストの情報を管理サーバ30へ入力する。なお翻訳サーバ60は、翻訳の必要がないテキストの情報が入力された場合、翻訳を行うことなく、テキストの情報を管理サーバ30へ入力すればよい。 The text information input on the smartphone 40 is input to the translation server 60 via a network such as the Internet. The translation server 60 translates the input text information as needed. When a text in a language other than the language supported by the management server 30 is input, the translation server 60 translates the text into a language supported by the management server 30. For example, when the management server 30 can handle Japanese and English and a Chinese request is input, the translation server 60 translates the request from Chinese to Japanese or English and converts the translated text information. Input to the management server 30. When text information that does not need to be translated is input, the translation server 60 may input the text information to the management server 30 without translating.
 管理サーバ30は、翻訳サーバ60から入力されたテキストの情報に基づいて、ユーザのリクエストに対する応答情報を生成する処理を行う。このときに行う管理サーバ30の処理は、音声入力によるユーザのリクエストに対する応答情報を生成する処理と同様であり、上述の意図識別モデル81及び応答情報DB82を用いて行われる。管理サーバ30は、生成したテキストの応答情報を翻訳サーバ60へ出力する。翻訳サーバ60は、管理サーバ30から与えられた応答情報を、必要に応じて適宜に翻訳し、スマートフォン40へ出力する。スマートフォン40は、翻訳サーバ60からテキストとして与えられた応答情報を、チャット画面にテキストとして表示することによって、ユーザが入力したリクエストに対する自動的な応答を行う。なお管理サーバ30は、音声入力のリクエストに対する応答情報を取得するための意図識別モデル81及び応答情報DB82と、テキスト入力のリクエストに対する応答情報を取得するための意図識別モデル81及び応答情報DB82とを別に備えていてもよい。 The management server 30 performs a process of generating response information to a user's request based on the text information input from the translation server 60. The processing of the management server 30 performed at this time is the same as the processing of generating response information for the user's request by voice input, and is performed using the above-mentioned intention identification model 81 and response information DB 82. The management server 30 outputs the response information of the generated text to the translation server 60. The translation server 60 appropriately translates the response information given by the management server 30 as necessary, and outputs the response information to the smartphone 40. The smartphone 40 automatically responds to the request input by the user by displaying the response information given as text from the translation server 60 as text on the chat screen. The management server 30 includes an intention identification model 81 and a response information DB 82 for acquiring response information for a voice input request, and an intention identification model 81 and a response information DB 82 for acquiring response information for a text input request. It may be prepared separately.
 チャットボットによるユーザへの情報提供は、例えばユーザがテキストで入力したリクエストが意図識別モデル81にて意図を識別できない場合、また例えば応答情報DB82に対応する応答情報が記憶されていない場合等に、適切な情報をユーザに対して提供できない可能性がある。本実施の形態に係る情報提供システムでは、チャットボットのテキスト入出力による情報提供を適切に行うことができない場合に、オペレータによる有人応答への切り替えを行う。本実施の形態に係る情報提供システムでは、ユーザからのリクエストに自動応答で対応できない場合に、管理サーバ30に代わってユーザからのリクエストに応答するオペレータが存在する。オペレータは、オペレータ端末70を利用して、ユーザのスマートフォン40との間でテキストによる情報交換、即ちチャットを行うことができる。ユーザは、チャットボットの自動応答と同じチャット画面を用いてオペレータとのチャットを行うことができる。 Information is provided to the user by the chatbot, for example, when the request input by the user in text cannot identify the intention by the intention identification model 81, or when the response information corresponding to the response information DB 82 is not stored, for example. It may not be possible to provide appropriate information to the user. In the information providing system according to the present embodiment, when the information cannot be appropriately provided by the text input / output of the chatbot, the operator switches to the manned response. In the information providing system according to the present embodiment, there is an operator who responds to the request from the user instead of the management server 30 when the request from the user cannot be responded to by the automatic response. The operator can use the operator terminal 70 to exchange textual information with the user's smartphone 40, that is, to chat. The user can chat with the operator using the same chat screen as the chatbot's automatic response.
 管理サーバ30は、テキスト入出力による自動応答ができないと判定した場合に、有人応答への切り替えを促すメッセージをスマートフォン40に出力させる。メッセージは例えば「リクエストにお答えすることができません。有人応答に切り替える場合には下の切替アイコンをタッチしてください。」等とすることができる。このメッセージに対してユーザによる切替アイコンのタッチ操作が行われ、有人応答への切り替えを肯定する旨の意志がユーザにより示された場合、管理サーバ30は、オペレータ端末70に有人応答を依頼する通知を送信し、チャットボットの自動応答からオペレータの有人応答への切り替えを行う。 When the management server 30 determines that the automatic response by text input / output cannot be performed, the management server 30 causes the smartphone 40 to output a message prompting the user to switch to the manned response. The message can be, for example, "We cannot answer your request. If you want to switch to a manned response, please touch the switch icon below." When the user performs a touch operation of the switching icon in response to this message and the user indicates the intention to affirm the switching to the manned response, the management server 30 notifies the operator terminal 70 to request the manned response. To switch from the chatbot's automatic response to the operator's manned response.
 なおチャット画面に設けられた切替アイコンに対するタッチ操作による有人応答への切り替えは、管理サーバ30が切り替えを促すメッセージをスマートフォン40に出力させた場合にのみ受け付けてもよく、このメッセージの出力の有無に関係なく常時的に受け付けてもよい。管理サーバ30が切り替えを促すメッセージをスマートフォン40に出力させた場合にのみ切り替えを受け付ける場合、不要の際には切替アイコンをチャット画面に表示しなくてよい。 It should be noted that switching to a manned response by touching the switching icon provided on the chat screen may be accepted only when the management server 30 outputs a message prompting the switching to the smartphone 40, and whether or not this message is output may be accepted. Regardless, it may be accepted at all times. When the management server 30 accepts the switch only when the message prompting the switch is output to the smartphone 40, the switch icon does not have to be displayed on the chat screen when it is not needed.
 オペレータによる有人応答への切り替え後、管理サーバ30は、スマートフォン40にてユーザにより入力され、必要に応じて翻訳サーバ60にて翻訳されたテキストの情報を、オペレータ端末70へ送信する。オペレータ端末70は、管理サーバ30からのテキストの情報を基に、図5と同様のチャット画面を表示部に表示する。オペレータは、オペレータ端末70に表示されたユーザのリクエストに対する応答の情報を、例えばキーボード等の入力装置を利用してオペレータ端末70へ入力する。なお本実施の形態においては、オペレータ端末70に対してオペレータがテキスト入力を行うものとするが、これに限るものではなく、音声入力等を行う構成であってもよい。オペレータ端末70は、オペレータによる応答のテキスト入力を受け付け、受け付けたテキスト情報を管理サーバ30へ送信する。管理サーバ30はオペレータ端末70からのテキスト情報を翻訳サーバ60へ与え、必要に応じて翻訳サーバ60にて翻訳されたテキスト情報がユーザのスマートフォン40へ与えられる。スマートフォン40は、与えられたテキスト情報をチャット画面に表示する。これらによりユーザ及びオペレータによるテキストを介したコミュニケーションが成立し、ユーザのリクエストに対してオペレータが応答することができる。 After switching to the manned response by the operator, the management server 30 transmits the text information input by the user on the smartphone 40 and translated by the translation server 60 to the operator terminal 70 as needed. The operator terminal 70 displays a chat screen similar to that in FIG. 5 on the display unit based on the text information from the management server 30. The operator inputs the information of the response to the user's request displayed on the operator terminal 70 to the operator terminal 70 by using an input device such as a keyboard. In the present embodiment, the operator inputs text to the operator terminal 70, but the present invention is not limited to this, and a configuration such as voice input may be performed. The operator terminal 70 accepts the text input of the response by the operator and transmits the received text information to the management server 30. The management server 30 gives the text information from the operator terminal 70 to the translation server 60, and the text information translated by the translation server 60 is given to the user's smartphone 40 as needed. The smartphone 40 displays the given text information on the chat screen. As a result, communication by the user and the operator via text is established, and the operator can respond to the user's request.
 なお本実施の形態においては、スマートスピーカ20を利用した音声入出力による情報提供からスマートフォン40を利用したテキスト入出力による情報提供への切り替えを、スマートスピーカ20に表示したアクセス用情報を介して行う構成としたが、これに限るものではない。例えば音声入出力及びテキスト入出力の両方を利用可能な装置が客室等に設置されている場合、管理サーバ30が、当該装置に対して音声入出力からテキスト入出力への切替指示を出力し、当該出力に基づいて、当該装置が音声入出力からテキスト入出力へ切り替わることとしてもよい。 In the present embodiment, switching from information provision by voice input / output using the smart speaker 20 to information provision by text input / output using the smartphone 40 is performed via the access information displayed on the smart speaker 20. Although it is configured, it is not limited to this. For example, when a device capable of using both voice input / output and text input / output is installed in a guest room or the like, the management server 30 outputs a switching instruction from voice input / output to text input / output to the device. Based on the output, the device may switch from voice input / output to text input / output.
<装置構成>
 図6は、本実施の形態に係るスマートスピーカ20の構成を示すブロック図である。本実施の形態に係るスマートスピーカ20は、処理部(プロセッサ)21、記憶部(ストレージ)22、通信部(トランシーバ)23、タッチパネル25、マイク26、スピーカ27及びカメラ28等を備えて構成されている。処理部21は、CPU(Central Processing Unit)又はMPU(Micro-Processing Unit)等の演算処理装置を用いて構成されている。また処理部21は、複数のCPU又はマルチコアCPU等が採用されてもよい。処理部21は、記憶部22に記憶されたプログラム22aを読み出して実行することにより、音声入力の受け付け、音声出力及び画像表示等の種々の処理を行う。
<Device configuration>
FIG. 6 is a block diagram showing the configuration of the smart speaker 20 according to the present embodiment. The smart speaker 20 according to the present embodiment includes a processing unit (processor) 21, a storage unit (storage) 22, a communication unit (transceiver) 23, a touch panel 25, a microphone 26, a speaker 27, a camera 28, and the like. There is. The processing unit 21 is configured by using an arithmetic processing unit such as a CPU (Central Processing Unit) or an MPU (Micro-Processing Unit). Further, the processing unit 21 may employ a plurality of CPUs, a multi-core CPU, or the like. The processing unit 21 reads and executes the program 22a stored in the storage unit 22 to perform various processes such as reception of voice input, voice output, and image display.
 記憶部22は、例えばフラッシュメモリなどの不揮発性のメモリ素子、又は、ハードディスクなどの磁気記憶装置等を用いて構成されている。記憶部22は、処理部21が実行する各種のプログラム、及び、処理部21の処理に必要な各種のデータを記憶する。本実施の形態において記憶部22は、処理部21が実行するプログラム22aと、スマートスピーカ20を識別するための識別情報22bとを記憶している。 The storage unit 22 is configured by using, for example, a non-volatile memory element such as a flash memory, a magnetic storage device such as a hard disk, or the like. The storage unit 22 stores various programs executed by the processing unit 21 and various data required for processing by the processing unit 21. In the present embodiment, the storage unit 22 stores the program 22a executed by the processing unit 21 and the identification information 22b for identifying the smart speaker 20.
 プログラム22aは、例えばスマートスピーカ20の製造段階において記憶部22に書き込まれてもよい。例えばプログラム22aは、遠隔のサーバ装置等が配信するものをスマートスピーカ20が通信にて取得してもよい。例えばプログラム22aは、メモリカード又は光ディスク等の記録媒体99に記録されたプログラム22aをスマートスピーカ20が読み出して記憶部22に記憶してもよい。例えばプログラム22aは、記録媒体99に記録されたものを書込装置が読み出してスマートスピーカ20の記憶部22に書き込んでもよい。プログラム22aは、ネットワークを介した配信の態様で提供されてもよく、記録媒体99に記録された態様で提供されてもよい。 The program 22a may be written in the storage unit 22 at the manufacturing stage of the smart speaker 20, for example. For example, in the program 22a, the smart speaker 20 may acquire what is distributed by a remote server device or the like by communication. For example, in the program 22a, the smart speaker 20 may read the program 22a recorded on the recording medium 99 such as a memory card or an optical disk and store it in the storage unit 22. For example, in the program 22a, the writing device may read what has been recorded on the recording medium 99 and write it in the storage unit 22 of the smart speaker 20. The program 22a may be provided in the form of distribution via the network, or may be provided in the form recorded on the recording medium 99.
 識別情報22bは、文字又は数値等が適宜に組み合わされた情報であり、スマートスピーカ20を識別し得る情報であればどのようなものであってもよい。本実施の形態において識別情報22bは、スマートスピーカ20が設置されている宿泊施設を識別する情報と、客室を識別する情報とが組み合わされた情報である。例えば識別情報22bは、Exampleホテルの301号室を示す情報とすることができる。識別情報22bは、スマートスピーカ20が宿泊施設の客室に設置される際に、システムの管理者又は宿泊施設の管理者等により設定される。 The identification information 22b is information in which characters, numerical values, and the like are appropriately combined, and may be any information as long as it can identify the smart speaker 20. In the present embodiment, the identification information 22b is information in which information for identifying the accommodation facility in which the smart speaker 20 is installed and information for identifying the guest room are combined. For example, the identification information 22b can be information indicating room 301 of the Example hotel. The identification information 22b is set by the system administrator, the accommodation facility administrator, or the like when the smart speaker 20 is installed in the guest room of the accommodation facility.
 通信部23は、インターネット、無線LAN(Local Area Network)及び携帯電話通信網等を含むネットワークNを介して、種々の装置との間で通信を行うことができる。本実施の形態においては、通信部23は、ネットワークNを介して音声処理サーバ50との通信を行う。通信部23は、処理部21から与えられたデータを音声処理サーバ50へ送信すると共に、音声処理サーバ50から受信したデータを処理部21へ与える。 The communication unit 23 can communicate with various devices via a network N including the Internet, a wireless LAN (Local Area Network), a mobile phone communication network, and the like. In the present embodiment, the communication unit 23 communicates with the voice processing server 50 via the network N. The communication unit 23 transmits the data given by the processing unit 21 to the voice processing server 50, and gives the data received from the voice processing server 50 to the processing unit 21.
 タッチパネル25は、スマートスピーカ20が備えるユーザインタフェースの1つであり、表示部25a及び入力部25bを有している。表示部25aは、液晶ディスプレイ等を用いて構成されており、種々の画像及び文字等を表示する。表示部25aは、処理部21の処理に基づいて種々の画像表示を行う。入力部25bは、ユーザによる接触を検知するセンサが表示部25aの表面に設けられたものであり、表示部25aに表示された画像等に対するタッチ(タップ)操作等を入力として受け付け、受け付けた操作を処理部21へ通知する。 The touch panel 25 is one of the user interfaces included in the smart speaker 20, and has a display unit 25a and an input unit 25b. The display unit 25a is configured by using a liquid crystal display or the like, and displays various images, characters, and the like. The display unit 25a displays various images based on the processing of the processing unit 21. The input unit 25b is provided with a sensor for detecting contact by the user on the surface of the display unit 25a, and receives and accepts a touch (tap) operation or the like for an image or the like displayed on the display unit 25a as an input. Is notified to the processing unit 21.
 マイク26及びスピーカ27は、スマートスピーカ20が備えるユーザインタフェースの1つであり、音声入出力によるユーザインタフェースを実現する。マイク26は、周辺の音声を取得してデジタルの音声情報に変換し、音声情報を処理部21へ与える。スピーカ27は、処理部21から与えられる音声情報に基づいて、音声出力を行う。また、カメラ28は、例えばユーザ又は客室等の撮像を行い、撮像した画像情報を処理部21へ与える。 The microphone 26 and the speaker 27 are one of the user interfaces included in the smart speaker 20, and realize a user interface by voice input / output. The microphone 26 acquires peripheral voice, converts it into digital voice information, and gives the voice information to the processing unit 21. The speaker 27 outputs audio based on the audio information given by the processing unit 21. Further, the camera 28 takes an image of, for example, a user or a guest room, and gives the captured image information to the processing unit 21.
 本実施の形態に係るスマートスピーカ20は、ユーザの発話によるリクエストをスピーカ27にて取得し、取得したリクエストに係る音声情報を通信部23にて音声処理サーバ50へ送信する。このときにスマートスピーカ20は、記憶部22に記憶した識別情報22bを付して音声情報を送信する。この音声情報は音声処理サーバ50にてテキスト情報に変換されて管理サーバ30へ与えられ、管理サーバ30はリクエストに対する応答をテキスト情報として音声処理サーバ50へ送信する。音声処理サーバ50は、管理サーバ30からのテキスト情報を音声情報に変換してスマートスピーカ20へ送信する。スマートスピーカ20は、音声処理サーバ50から与えられた音声情報をスピーカ27から出力する。 The smart speaker 20 according to the present embodiment acquires a request uttered by the user at the speaker 27, and transmits the voice information related to the acquired request to the voice processing server 50 at the communication unit 23. At this time, the smart speaker 20 attaches the identification information 22b stored in the storage unit 22 and transmits the voice information. This voice information is converted into text information by the voice processing server 50 and given to the management server 30, and the management server 30 transmits a response to the request as text information to the voice processing server 50. The voice processing server 50 converts the text information from the management server 30 into voice information and transmits it to the smart speaker 20. The smart speaker 20 outputs the voice information given by the voice processing server 50 from the speaker 27.
 また管理サーバ30は、音声処理サーバ50から与えられたテキスト情報によるユーザのリクエストに応答できない場合、その旨を通知すると共に、チャットボットへアクセスするためのアクセス用情報を送信する。この通知及びアクセス用情報は、音声処理サーバ50を介してスマートスピーカ20へ与えられる。この通知に応じてスマートスピーカ20は、チャットボットへの切り替えを行うか否かをユーザに問い合わせ、チャットボットへの切り替えが肯定された場合に、図3に示すように、二次元コードのアクセス用情報をタッチパネル25の表示部25aに表示する。 If the management server 30 cannot respond to the user's request based on the text information given by the voice processing server 50, the management server 30 notifies that fact and transmits access information for accessing the chatbot. This notification and access information is given to the smart speaker 20 via the voice processing server 50. In response to this notification, the smart speaker 20 asks the user whether or not to switch to the chatbot, and when the switch to the chatbot is affirmed, as shown in FIG. 3, for accessing the two-dimensional code. The information is displayed on the display unit 25a of the touch panel 25.
 図7は、本実施の形態に係る管理サーバ30の構成を示すブロック図である。本実施の形態に係る管理サーバ30は、処理部(プロセッサ)31、記憶部(ストレージ)32及び通信部(トランシーバ)33等を備えて構成されている。処理部31は、CPU、MPU又はGPU(Graphics Processing Unit)等の演算処理装置を用いて構成されている。処理部31は、記憶部32に記憶されたサーバプログラム32aを読み出して実行することにより、ユーザのリクエストに応答する処理、音声入出力からチャットボットへの切替処理、及び、チャットボットから有人応答への切替処理等の種々の処理を行う。 FIG. 7 is a block diagram showing the configuration of the management server 30 according to the present embodiment. The management server 30 according to the present embodiment includes a processing unit (processor) 31, a storage unit (storage) 32, a communication unit (transceiver) 33, and the like. The processing unit 31 is configured by using an arithmetic processing unit such as a CPU, MPU, or GPU (Graphics Processing Unit). The processing unit 31 reads and executes the server program 32a stored in the storage unit 32 to respond to a user's request, switch from voice input / output to a chatbot, and change from a chatbot to a manned response. Performs various processes such as switching process.
 記憶部32は、例えばハードディスク等の大容量の記憶装置を用いて構成されている。記憶部32は、処理部31が実行する各種のプログラム、及び、処理部31の処理に必要な各種のデータを記憶する。本実施の形態において記憶部32は、処理部31が実行するサーバプログラム32aと、学習済モデル(識別器)として意図識別モデル81とを記憶している。また記憶部32には、ユーザのリクエストに対する応答が記憶された応答情報DB82と、有人応答を行うオペレータに関する情報が記憶されたオペレータDB83との2つのデータベースが設けられている。 The storage unit 32 is configured by using a large-capacity storage device such as a hard disk. The storage unit 32 stores various programs executed by the processing unit 31 and various data required for processing by the processing unit 31. In the present embodiment, the storage unit 32 stores the server program 32a executed by the processing unit 31 and the intention identification model 81 as a learned model (discriminative device). Further, the storage unit 32 is provided with two databases, a response information DB 82 in which a response to a user's request is stored, and an operator DB 83 in which information about an operator who performs a manned response is stored.
 通信部33は、インターネット、無線LAN及び携帯電話通信網等を含むネットワークNを介して、種々の装置との間で通信を行うことができる。本実施の形態においては、通信部33は、ネットワークNを介して音声処理サーバ50、翻訳サーバ60及びオペレータ端末70等との通信を行う。通信部33は、処理部31から与えられたデータを他の装置へ送信すると共に、他の装置から受信したデータを処理部31へ与える。 The communication unit 33 can communicate with various devices via the network N including the Internet, wireless LAN, mobile phone communication network, and the like. In the present embodiment, the communication unit 33 communicates with the voice processing server 50, the translation server 60, the operator terminal 70, and the like via the network N. The communication unit 33 transmits the data given by the processing unit 31 to another device, and gives the data received from the other device to the processing unit 31.
 なお記憶部32は、管理サーバ30に接続された外部記憶装置であってよい。また管理サーバ30は、複数のコンピュータを含んで構成されるマルチコンピュータであってよく、ソフトウェアによって仮想的に構築された仮想マシンであってもよい。また管理サーバ30は、上記の構成に限定されず、例えば可搬型の記憶媒体に記憶された情報を読み取る読取部、操作入力を受け付ける入力部、又は、画像を表示する表示部等を含んでもよい。 The storage unit 32 may be an external storage device connected to the management server 30. Further, the management server 30 may be a multi-computer including a plurality of computers, or may be a virtual machine virtually constructed by software. Further, the management server 30 is not limited to the above configuration, and may include, for example, a reading unit that reads information stored in a portable storage medium, an input unit that accepts operation input, a display unit that displays an image, and the like. ..
 サーバプログラム32aは、例えば管理サーバ30の製造段階において記憶部32に書き込まれてもよい。例えばサーバプログラム32aは、遠隔の他のサーバ装置等が配信するものを管理サーバ30が通信にて取得してもよい。例えばサーバプログラム32aは、メモリカード又は光ディスク等の記録媒体98に記録されたサーバプログラム32aを管理サーバ30が読み出して記憶部32に記憶してもよい。例えばサーバプログラム32aは、記録媒体98に記録されたものを書込装置が読み出して管理サーバ30の記憶部32に書き込んでもよい。サーバプログラム32aは、ネットワークを介した配信の態様で提供されてもよく、記録媒体98に記録された態様で提供されてもよい。 The server program 32a may be written to the storage unit 32, for example, at the manufacturing stage of the management server 30. For example, in the server program 32a, the management server 30 may acquire what is distributed by another remote server device or the like by communication. For example, in the server program 32a, the management server 30 may read the server program 32a recorded on the recording medium 98 such as a memory card or an optical disk and store it in the storage unit 32. For example, in the server program 32a, the writing device may read what was recorded on the recording medium 98 and write it in the storage unit 32 of the management server 30. The server program 32a may be provided in a mode of distribution via a network, or may be provided in a mode recorded on a recording medium 98.
 意図識別モデル81は、予め教師データを用いた機械学習又は深層学習等がなされた学習済モデルである。学習済モデルは、入力値に対して所定の演算を行い、演算結果を出力するものであり、記憶部32にはこの演算を規定する関数の係数及び閾値等のデータが意図識別モデル81として記憶される。意図識別モデル81は、ユーザが音声又はテキストとして入力した文章の情報に対し、このユーザの意図を識別するように学習された学習済モデルである。サーバプログラム32aを実行する処理部31が、意図識別モデル81として記憶されたデータを読み込むことによって、ユーザのリクエストの意図を識別するための演算を処理部31が実行することが可能となる。 The intention identification model 81 is a trained model in which machine learning or deep learning using teacher data has been performed in advance. The trained model performs a predetermined operation on the input value and outputs the operation result, and the storage unit 32 stores data such as the coefficient and the threshold of the function that defines this operation as the intention identification model 81. Will be done. The intention identification model 81 is a learned model learned to identify the user's intention with respect to the text information input by the user as voice or text. The processing unit 31 that executes the server program 32a reads the data stored as the intention identification model 81, so that the processing unit 31 can execute an operation for identifying the intention of the user's request.
 図8は、意図識別モデル81の一構成例を示す模式図である。本実施の形態において意図識別モデル81は、ユーザのリクエストに係るテキスト情報の入力を受け付ける入力層と、入力された情報に対して所定の演算を行う中間層と、ユーザのリクエストの意図の識別結果を示す情報を出力する出力層とを有するニューラルネットワークとして構成される。本実施の形態において意図識別モデル81は、ニューラルネットワークとして例えばRNN(Recurrent Neural Network)の構成が採用され得るが、これに限るものではなく、RNN以外のモデルが採用されてもよい。管理サーバ30は、RNNのモデルに対して、リクエストに係るテキスト情報及び意図を対応付けた教師データを用いて学習を行うことにより、意図識別モデル81を生成する。 FIG. 8 is a schematic diagram showing a configuration example of the intention identification model 81. In the present embodiment, the intention identification model 81 includes an input layer that accepts input of text information related to a user's request, an intermediate layer that performs a predetermined operation on the input information, and an identification result of the intention of the user's request. It is configured as a neural network having an output layer that outputs information indicating the above. In the present embodiment, the intention identification model 81 may adopt, for example, a configuration of RNN (Recurrent Neural Network) as the neural network, but the present invention is not limited to this, and a model other than RNN may be adopted. The management server 30 generates the intention identification model 81 by learning the RNN model using the text information related to the request and the teacher data associated with the intention.
 なお、本実施の形態においては、意図識別モデル81を管理サーバ30が備える構成とするが、これに限るものではない。例えば意図識別モデル81を別のサーバ装置が備え、管理サーバ30が取得したユーザのリクエストに係るテキスト情報を別のサーバ装置へ送信し、別のサーバ装置が有する意図識別モデル81による識別結果を管理サーバ30が取得してもよい。 In the present embodiment, the management server 30 is provided with the intention identification model 81, but the present invention is not limited to this. For example, the intention identification model 81 is provided in another server device, the text information related to the user's request acquired by the management server 30 is transmitted to the other server device, and the identification result by the intention identification model 81 possessed by the other server device is managed. The server 30 may acquire it.
 また本実施の形態においては、意図識別モデル81の学習処理を管理サーバ30が行う。ただし学習処理は、別のサーバ装置により行われてもよい。この場合に学習された意図識別モデル81は、別のサーバ装置から管理サーバ30へ送信されて記憶部32に記憶されてもよく、別のサーバ装置が保持していてもよい。学習済の意図識別モデル81は、サーバプログラム32aと同様に、ネットワークを介した配信の態様で提供されてもよく、記録媒体98に記録された態様で提供されてもよい。 Further, in the present embodiment, the management server 30 performs the learning process of the intention identification model 81. However, the learning process may be performed by another server device. The intention identification model 81 learned in this case may be transmitted from another server device to the management server 30 and stored in the storage unit 32, or may be held by another server device. The trained intention identification model 81 may be provided in the form of distribution via the network, or may be provided in the form recorded on the recording medium 98, similarly to the server program 32a.
 図9は、応答情報DB82の一構成例を示す模式図である。応答情報DB82は、ユーザのリクエストの意図と、このリクエストに対する応答のメッセージ等の応答情報とが対応付けて記憶されたデータベースである。応答情報DB82のリクエストの意図の各項目は、意図識別モデル81の識別結果に対応するものである。また応答情報DB82の応答情報は、リクエストの意図に対してスマートスピーカ20の音声出力又はチャットボットによるテキスト出力として応答するメッセージのテキスト情報である。なお、リクエストに対する応答は宿泊施設により異なるため、応答情報DB82は宿泊施設毎に設けられる。また図9に示す応答情報DB82は日本語のものであるが、必要に応じて英語又は中国語等の他の言語に対応する応答情報DB82が設けられてよい。 FIG. 9 is a schematic diagram showing a configuration example of the response information DB 82. The response information DB 82 is a database in which the intention of the user's request and the response information such as a response message for this request are stored in association with each other. Each item of the request intention of the response information DB 82 corresponds to the identification result of the intention identification model 81. The response information of the response information DB 82 is text information of a message that responds to the intention of the request as a voice output of the smart speaker 20 or a text output by a chatbot. Since the response to the request differs depending on the accommodation facility, the response information DB 82 is provided for each accommodation facility. The response information DB 82 shown in FIG. 9 is in Japanese, but a response information DB 82 corresponding to another language such as English or Chinese may be provided as needed.
 図9に示す例は、宿泊施設として「Exampleホテル」の応答情報DB82のリクエストの意図及び応答情報の一例である。なおスマートスピーカ20から音声処理サーバ50を介して管理サーバ30へ与えられるリクエスト、及び、スマートフォン40から翻訳サーバ60を介して管理サーバ30へ与えられるリクエストには、宿泊施設及び客室等を識別するための識別情報が例えばヘッダ情報等として付されている。管理サーバ30は、複数の宿泊施設について応答情報DB82を有しており、リクエストに付された識別情報に応じて、宿泊施設に適した応答情報DB82を用いることができる。 The example shown in FIG. 9 is an example of the request intention and response information of the response information DB 82 of the “Example hotel” as an accommodation facility. The request given from the smart speaker 20 to the management server 30 via the voice processing server 50 and the request given from the smartphone 40 to the management server 30 via the translation server 60 are used to identify accommodation facilities, guest rooms, and the like. Identification information is attached as, for example, header information. The management server 30 has response information DB 82 for a plurality of accommodation facilities, and can use response information DB 82 suitable for the accommodation facility according to the identification information attached to the request.
 図示の例では、「チェックインは何時ですか?」というリクエストの意図に、「14:00です。有料で12:00からのアーリーチェックインも可能です。」という応答情報が対応付けて記憶されている。また「チェックインの際に現金が必要ですか?」というリクエストの意図に、「クレジットカード払いも可能です。」という応答情報が対応付けて記憶されている。また「フロントはどこですか?」というリクエストの意図に、「1階です。」という応答情報が対応付けて記憶されている。なお図示のリクエストの意図及び応答情報は、一例であって、これに限るものではない。 In the illustrated example, the response information "It is 14:00. Early check-in from 12:00 is also possible for a fee." Is stored in association with the intention of the request "What time is check-in?" ing. In addition, the response information "Credit card payment is also possible" is stored in association with the intention of the request "Do you need cash at check-in?" In addition, the response information "It is the first floor" is stored in association with the intention of the request "Where is the front desk?". The intention and response information of the requested request shown in the figure is an example, and is not limited to this.
 図10は、オペレータDB83の一構成例を示す模式図である。オペレータDB83は、日時に関する条件と、有人応答への切り替え先となるオペレータに関する情報とが対応付けて記憶されたデータベースである。オペレータDB83は宿泊施設毎に設けられる。図示の例では、平日の午前8時から午後8時までと、休日の午前6時から午後10時までとの日時に対して、オペレータセンターのオペレータが有人応答を行うことが設定されている。また、平日の午前0時から午前8時まで及び午後8時から午後12時までと、休日の午前0時から午前6時まで及び午後10時から午後12時までとの日時に対して、ホテルフロントのオペレータが有人応答を行うことが設定されている。管理サーバ30は、有人応答へ切り替える際に、日時情報に基づいてオペレータDB83を参照し、有人応答を行うオペレータを決定する。管理サーバ30は、決定したオペレータが利用するオペレータ端末70に有人応答を行う要求を送信する。なお図示の例では、オペレータとしてオペレータセンター及びホテルフロント等が設定されているが、実際にはこれらに設けられたオペレータ端末70の識別情報等が設定されてよい。 FIG. 10 is a schematic diagram showing a configuration example of the operator DB 83. The operator DB 83 is a database in which conditions relating to the date and time and information relating to the operator to be switched to the manned response are stored in association with each other. The operator DB 83 is provided for each accommodation facility. In the illustrated example, it is set that the operator of the operator center makes a manned response to the date and time from 8:00 am to 8:00 pm on weekdays and from 6:00 am to 10:00 pm on holidays. Also, for dates and times on weekdays from midnight to 8 am and 8 pm to 12 pm and on holidays from midnight to 6 am and from 10 pm to 12 pm The front operator is set to provide a manned response. When switching to the manned response, the management server 30 refers to the operator DB 83 based on the date and time information and determines the operator to perform the manned response. The management server 30 transmits a request for a manned response to the operator terminal 70 used by the determined operator. In the illustrated example, the operator center, the hotel front, and the like are set as operators, but in reality, the identification information and the like of the operator terminal 70 provided in these may be set.
 またオペレータDB83には、オペレータが対応可能な言語に関する情報が記憶されてもよい。これにより管理サーバ30は、ユーザが使用する言語での対応が可能なオペレータに有人応答を行わせることが可能となる。管理サーバ30は、ユーザが使用する言語について、例えば有人応答への切り替えの際にユーザに言語を選択させてもよく、また例えば宿泊施設の宿泊者名簿等の情報からユーザの言語を判断してもよい。 Further, the operator DB 83 may store information about the language that the operator can handle. As a result, the management server 30 can make the operator who can handle the language used by the user perform a manned response. The management server 30 may allow the user to select the language used by the user, for example, when switching to a manned response, or determines the user's language from information such as a guest list of accommodation facilities. May be good.
 また本実施の形態に係る管理サーバ30は、記憶部32に記憶されたサーバプログラム32aを処理部31が読み出して実行することにより、リクエスト取得部31a、応答処理部31b、切替処理部31c及び学習処理部31d等がソフトウェア的な機能ブロックとして実現される。リクエスト取得部31aは、通信部33にて音声処理サーバ50との通信を行うことによって、ユーザがスマートスピーカ20に対して音声入力し、音声処理サーバ50がテキスト情報に変換したリクエストを取得する処理を行う。またリクエスト取得部31aは、通信部33にて翻訳サーバ60との通信を行うことによって、ユーザがスマートフォン40に対して入力し、翻訳サーバ60にて適宜の言語に翻訳されたリクエストを取得する処理を行う。リクエスト取得部31aが取得するリクエストの情報は、いずれもテキスト情報である。 Further, in the management server 30 according to the present embodiment, the processing unit 31 reads and executes the server program 32a stored in the storage unit 32, so that the request acquisition unit 31a, the response processing unit 31b, the switching processing unit 31c, and the learning unit 31c are executed. The processing unit 31d and the like are realized as software-like functional blocks. The request acquisition unit 31a is a process in which the communication unit 33 communicates with the voice processing server 50 so that the user inputs voice to the smart speaker 20 and the voice processing server 50 acquires a request converted into text information. I do. Further, the request acquisition unit 31a is a process in which the communication unit 33 communicates with the translation server 60 so that the user inputs to the smartphone 40 and the translation server 60 acquires a request translated into an appropriate language. I do. The request information acquired by the request acquisition unit 31a is all text information.
 応答処理部31bは、記憶部32に記憶された意図識別モデル81及び応答情報DB82を用いて、リクエスト取得部31aが取得したリクエストに対する応答を行う。応答処理部31bは、リクエスト取得部31aが取得したリクエストのテキスト情報を意図識別モデル81へ入力し、意図識別モデル81が出力する意図の識別結果を取得する。応答処理部31bは、取得したリクエストの意図に基づいて、応答情報DB82を参照し、意図に対応する応答情報を取得する。応答処理部31bは、取得した応答情報を音声処理サーバ50又は翻訳サーバ60へ送信し、スマートスピーカ20の音声出力による応答又はスマートフォン40のテキスト出力による応答を行う。 The response processing unit 31b responds to the request acquired by the request acquisition unit 31a by using the intention identification model 81 and the response information DB 82 stored in the storage unit 32. The response processing unit 31b inputs the text information of the request acquired by the request acquisition unit 31a into the intention identification model 81, and acquires the intention identification result output by the intention identification model 81. The response processing unit 31b refers to the response information DB 82 based on the intention of the acquired request, and acquires the response information corresponding to the intention. The response processing unit 31b transmits the acquired response information to the voice processing server 50 or the translation server 60, and makes a response by the voice output of the smart speaker 20 or a response by the text output of the smartphone 40.
 切替処理部31cは、リクエスト取得部31aが取得したユーザのリクエストに対して、適切な応答を行うことができない場合に、スマートスピーカ20を利用する音声応答からスマートフォン40を利用するチャットボットへの切り替え、又は、チャットボットから有人応答への切り替えを行う。切替処理部31cは、例えばリクエスト取得部31aが取得したリクエストのテキスト情報に対して意図識別モデル81が意図を識別できなかった場合、又は、識別された意図に対応する応答情報が応答情報DB82から得られなかった場合等に、切り替えを行う。 The switching processing unit 31c switches from a voice response using the smart speaker 20 to a chatbot using the smartphone 40 when an appropriate response cannot be made to the user's request acquired by the request acquisition unit 31a. Or, switch from a chatbot to a manned response. In the switching processing unit 31c, for example, when the intention identification model 81 cannot identify the intention with respect to the text information of the request acquired by the request acquisition unit 31a, or the response information corresponding to the identified intention is transmitted from the response information DB 82. If it cannot be obtained, switch it.
 切替処理部31cは、音声処理サーバ50から与えられたリクエストに対して応答を行うことができない場合、このリクエストの送信元となるスマートスピーカ20に対して、チャットボットへの切り替えを促す指示を送信する。このときに管理サーバ30から音声処理サーバ50を介してスマートスピーカ20へ与えられる指示には、チャットボットへのアクセスに必要なアクセス用情報20aの画像が含まれる。アクセス用情報20aは、例えばチャットボットにアクセスするためのURLと、スマートスピーカ20が設置された宿泊施設及び客室の識別情報とを二次元コード化した情報である。この指示を受信したスマートスピーカ20は、チャットボットへの切り替えの可否をユーザに問い合わせ、切り替えを行う旨の回答が得られた場合にアクセス用情報20aを表示する。 When the switching processing unit 31c cannot respond to the request given by the voice processing server 50, the switching processing unit 31c transmits an instruction prompting the smart speaker 20 that is the source of this request to switch to the chatbot. To do. At this time, the instruction given from the management server 30 to the smart speaker 20 via the voice processing server 50 includes an image of the access information 20a necessary for accessing the chatbot. The access information 20a is, for example, two-dimensionally coded information of a URL for accessing a chatbot and identification information of an accommodation facility and a guest room in which the smart speaker 20 is installed. Upon receiving this instruction, the smart speaker 20 asks the user whether or not to switch to the chatbot, and displays the access information 20a when a reply to the effect of switching is obtained.
 切替処理部31cは、翻訳サーバ60から与えられたリクエストに対して応答を行うことができない場合、このリクエストの送信元となるスマートフォン40に対して、有人応答への切り替えを促す指示を送信する。この指示を受信したスマートフォン40は、チャットボットから有人応答への切り替えの可否をユーザに問い合わせ、切り替えを行う旨の回答が得られた場合に切り替えの要求を管理サーバ30へ送信する。この要求を受信した管理サーバ30の切替処理部31cは、その時点の日時に関する情報を取得して、記憶部32に記憶されたオペレータDB83を参照し、切り替え先となるオペレータを決定する。またこのときに管理サーバ30は、オペレータが対応可能な言語を考慮して、切り替え先のオペレータを決定してもよい。切替処理部31cは、決定したオペレータが利用するオペレータ端末70に有人応答を行う指示を送信する。以後、管理サーバ30は、スマートフォン40とオペレータ端末70との間で、ユーザのリクエスト及びオペレータの応答に係るテキスト情報を中継する。 When the switching processing unit 31c cannot respond to the request given by the translation server 60, the switching processing unit 31c transmits an instruction prompting the switching to the manned response to the smartphone 40 that is the transmission source of this request. Upon receiving this instruction, the smartphone 40 inquires the user whether or not to switch from the chatbot to the manned response, and when a reply to the effect of switching is obtained, sends a switching request to the management server 30. Upon receiving this request, the switching processing unit 31c of the management server 30 acquires information on the date and time at that time, refers to the operator DB 83 stored in the storage unit 32, and determines the operator to be switched to. At this time, the management server 30 may determine the operator to switch to in consideration of the languages that the operator can handle. The switching processing unit 31c transmits an instruction to perform a manned response to the operator terminal 70 used by the determined operator. After that, the management server 30 relays text information related to the user's request and the operator's response between the smartphone 40 and the operator terminal 70.
 学習処理部31dは、意図識別モデル81を学習(再学習)する処理を行う。学習処理部31dは、例えば切替処理部31cによる上述の切り替えが行われ、最終的にオペレータによる有人応答が完了した後、オペレータに対して教師データの入力を求める。このときに学習処理部31dは、応答処理部31bで応答することができなかったユーザのリクエストに係るテキスト情報をオペレータ端末70に表示し、このテキスト情報に対して実際にオペレータが判断したリクエストの意図の入力を求める。学習処理部31dは、ユーザのリクエストのテキスト情報と、これに対してオペレータにより入力された意図とを組み合わせた情報を教師データとしてオペレータ端末70から取得し、記憶部32に蓄積する。学習処理部31dは、所定量の教師データが蓄積された場合、又は、教師データを取得する都度、新たに記憶された教師データを用いて記憶部32に記憶された意図識別モデル81を再学習する。なお意図識別モデル81の再学習は別のサーバ装置にて行い、管理サーバ30は教師データを蓄積するのみであってもよい。 The learning processing unit 31d performs a process of learning (re-learning) the intention identification model 81. The learning processing unit 31d requests the operator to input teacher data after, for example, the switching processing unit 31c performs the above-mentioned switching and finally the manned response by the operator is completed. At this time, the learning processing unit 31d displays the text information related to the user's request that the response processing unit 31b could not respond to on the operator terminal 70, and the request that the operator actually determines with respect to this text information. Ask for intent input. The learning processing unit 31d acquires information obtained by combining the text information of the user's request and the intention input by the operator as teacher data from the operator terminal 70 and stores it in the storage unit 32. The learning processing unit 31d relearns the intention identification model 81 stored in the storage unit 32 using the newly stored teacher data when a predetermined amount of teacher data is accumulated or each time the teacher data is acquired. To do. The re-learning of the intention identification model 81 may be performed by another server device, and the management server 30 may only store the teacher data.
 図11は、本実施の形態に係るスマートフォン40の構成を示すブロック図である。本実施の形態に係るスマートフォン40は、処理部(プロセッサ)41、記憶部(ストレージ)42、通信部(トランシーバ)43、タッチパネル44及びカメラ45等を備えて構成されている。処理部41は、CPU又はMPU等の演算処理装置を用いて構成されている。処理部41は、記憶部42に記憶されたプログラム42aを読み出して実行することにより、管理サーバ30へアクセスする処理及びチャットを実現するための処理等の種々の処理を行う。 FIG. 11 is a block diagram showing the configuration of the smartphone 40 according to the present embodiment. The smartphone 40 according to the present embodiment includes a processing unit (processor) 41, a storage unit (storage) 42, a communication unit (transceiver) 43, a touch panel 44, a camera 45, and the like. The processing unit 41 is configured by using an arithmetic processing unit such as a CPU or MPU. The processing unit 41 reads and executes the program 42a stored in the storage unit 42 to perform various processing such as a processing for accessing the management server 30 and a processing for realizing a chat.
 記憶部32は、例えばフラッシュメモリなどの不揮発性のメモリ素子を用いて構成されている。記憶部42は、処理部41が実行する各種のプログラム、及び、処理部41の処理に必要な各種のデータを記憶する。本実施の形態において記憶部42は、処理部41が実行するプログラム42aを記憶している。プログラム42aは、例えばスマートフォン40の製造段階において記憶部42に書き込まれてもよい。例えばプログラム42aは、遠隔のサーバ装置等が配信するものをスマートフォン40が通信にて取得してもよい。例えばプログラム42aは、メモリカード又は光ディスク等の記録媒体に記録されたプログラム42aをスマートフォン40が読み出して記憶部42に記憶してもよい。例えばプログラム42aは、記録媒体に記録されたものを書込装置が読み出してスマートフォン40の記憶部42に書き込んでもよい。プログラム42aは、ネットワークを介した配信の態様で提供されてもよく、記録媒体に記録された態様で提供されてもよい。 The storage unit 32 is configured by using a non-volatile memory element such as a flash memory. The storage unit 42 stores various programs executed by the processing unit 41 and various data required for processing by the processing unit 41. In the present embodiment, the storage unit 42 stores the program 42a executed by the processing unit 41. The program 42a may be written to the storage unit 42, for example, at the manufacturing stage of the smartphone 40. For example, in the program 42a, the smartphone 40 may acquire what is distributed by a remote server device or the like by communication. For example, in the program 42a, the smartphone 40 may read the program 42a recorded on a recording medium such as a memory card or an optical disk and store it in the storage unit 42. For example, in the program 42a, the writing device may read out what is recorded on the recording medium and write it in the storage unit 42 of the smartphone 40. The program 42a may be provided in a mode of distribution via a network, or may be provided in a mode recorded on a recording medium.
 通信部43は、インターネット、無線LAN及び携帯電話通信網等を含むネットワークNを介して、種々の装置との間で通信を行うことができる。本実施の形態においては、通信部43は、ネットワークNを介して翻訳サーバ60との通信を行う。通信部43は、処理部41から与えられたデータを翻訳サーバ60へ送信すると共に、翻訳サーバ60から受信したデータを処理部41へ与える。 The communication unit 43 can communicate with various devices via the network N including the Internet, wireless LAN, mobile phone communication network, and the like. In the present embodiment, the communication unit 43 communicates with the translation server 60 via the network N. The communication unit 43 transmits the data given by the processing unit 41 to the translation server 60, and gives the data received from the translation server 60 to the processing unit 41.
 タッチパネル44は、スマートフォン40が備えるユーザインタフェースの1つであり、表示部及び入力部を有している。タッチパネル44の表示部は、液晶ディスプレイ等を用いて構成されており、処理部41の処理に基づいて種々の画像及び文字等を表示する。タッチパネル44の入力部は、ユーザによる接触を検知するセンサが表示部の表面に設けられたものであり、表示部に表示された画像等に対するタッチ(タップ)操作等を入力として受け付け、受け付けた操作を処理部41へ通知する。 The touch panel 44 is one of the user interfaces included in the smartphone 40, and has a display unit and an input unit. The display unit of the touch panel 44 is configured by using a liquid crystal display or the like, and displays various images, characters, and the like based on the processing of the processing unit 41. The input unit of the touch panel 44 is provided with a sensor for detecting contact by the user on the surface of the display unit, and receives and accepts a touch (tap) operation or the like for an image or the like displayed on the display unit as input. Is notified to the processing unit 41.
 カメラ45は、周辺の撮像を行い、撮像した画像情報を処理部41へ与える。本実施の形態においてスマートフォン40のカメラ45は、スマートスピーカ20のタッチパネル25に表示されたアクセス用情報20aの撮像に用いられる。 The camera 45 takes an image of the surroundings and gives the captured image information to the processing unit 41. In the present embodiment, the camera 45 of the smartphone 40 is used for capturing the access information 20a displayed on the touch panel 25 of the smart speaker 20.
 また本実施の形態に係るスマートフォン40は、記憶部42に記憶されたプログラム42aを処理部41が読み出して実行することにより、アクセス用情報取得部41a及びチャット処理部41b等がソフトウェア的な機能ブロックとして実現される。アクセス用情報取得部41aは、スマートスピーカ20がタッチパネル25に表示する二次元コードのアクセス用情報20aを取得する処理を行う。アクセス用情報取得部41aは、例えばスマートスピーカ20に表示されたアクセス用情報20aの撮像を促すメッセージをタッチパネル44に表示し、カメラ45にて撮像されたアクセス用情報20aの画像を取得する。アクセス用情報取得部41aは、取得した画像を基に、アクセス用情報20aに含まれるチャットボットへのアクセス用のURL、並びに、スマートスピーカ20が設置された宿泊施設及び客室の識別情報等を取得する。 Further, in the smartphone 40 according to the present embodiment, the processing unit 41 reads and executes the program 42a stored in the storage unit 42, so that the access information acquisition unit 41a, the chat processing unit 41b, and the like are software-like functional blocks. Is realized as. The access information acquisition unit 41a performs a process of acquiring the access information 20a of the two-dimensional code displayed on the touch panel 25 by the smart speaker 20. For example, the access information acquisition unit 41a displays a message prompting the image of the access information 20a displayed on the smart speaker 20 on the touch panel 44, and acquires an image of the access information 20a imaged by the camera 45. Based on the acquired image, the access information acquisition unit 41a acquires the URL for accessing the chatbot included in the access information 20a, the identification information of the accommodation facility and the guest room where the smart speaker 20 is installed, and the like. To do.
 チャット処理部41bは、アクセス用情報取得部41aが取得したアクセス用情報に基づいて、チャットボットを提供する管理サーバ30へのアクセスを(翻訳サーバ60を介して)行う。チャット処理部41bは、図4に示す言語の選択ボックスを表示してチャットに用いる言語の選択を受け付けた後、図5に示すチャット画面を表示する。チャット処理部41bは、ユーザが入力したリクエストのテキスト情報を取得し、取得したテキスト情報を翻訳サーバ60を介して管理サーバ30へ送信する。またチャット処理部41bは、翻訳サーバ60を介して管理サーバ30から送信された応答のテキスト情報を受信し、受信したテキスト情報をチャット画面に表示する。なおチャット処理部41bは、チャットの相手がチャットボットによる自動応答であるか、オペレータの有人応答であるかに関わらず、同様の処理でチャットを進めることができる。 The chat processing unit 41b accesses the management server 30 that provides the chatbot (via the translation server 60) based on the access information acquired by the access information acquisition unit 41a. The chat processing unit 41b displays the language selection box shown in FIG. 4 and accepts the selection of the language used for chat, and then displays the chat screen shown in FIG. The chat processing unit 41b acquires the text information of the request input by the user, and transmits the acquired text information to the management server 30 via the translation server 60. Further, the chat processing unit 41b receives the text information of the response transmitted from the management server 30 via the translation server 60, and displays the received text information on the chat screen. The chat processing unit 41b can proceed with the chat by the same processing regardless of whether the chat partner is an automatic response by the chatbot or a manned response of the operator.
<応答切替処理>
 本実施の形態に係る情報提供システムでは、スマートスピーカ20を利用した音声応答、スマートフォン40を利用したチャットボットによる自動応答、及び、スマートフォン40を利用したオペレータによる有人応答の三種類の情報提供方法をユーザに対して提供している。また本実施の形態に係る情報提供システムでは、スマートスピーカ20の音声応答からチャットボットの自動応答への切り替え、及び、チャットボットの自動応答からオペレータの有人応答への切り替えの判断を管理サーバ30が行い、ユーザに情報提供方法の切り替えを促すことで、円滑な情報提供方法の切り替えを実現している。
<Response switching process>
In the information providing system according to the present embodiment, three types of information providing methods are provided: a voice response using the smart speaker 20, an automatic response by a chatbot using the smartphone 40, and a manned response by the operator using the smartphone 40. It is provided to users. Further, in the information providing system according to the present embodiment, the management server 30 determines the determination of switching from the voice response of the smart speaker 20 to the automatic response of the chatbot and switching from the automatic response of the chatbot to the manned response of the operator. By doing this and encouraging the user to switch the information provision method, smooth switching of the information provision method is realized.
(1)スマートスピーカ20の音声応答からチャットボットの自動応答への切り替え
 例えば、宿泊施設に宿泊するユーザは、宿泊施設に関する質問又は要求等を、客室に設置されたスマートスピーカ20を利用して音声入力によりリクエストする。スマートスピーカ20は入力されたリクエストに係る音声情報を音声処理サーバ50へ送信し、音声処理サーバ50は音声情報をテキスト情報に変換して管理サーバ30へ送信する。
(1) Switching from the voice response of the smart speaker 20 to the automatic response of the chatbot For example, a user staying at an accommodation facility asks a question or request regarding the accommodation facility by using the smart speaker 20 installed in the guest room. Request by input. The smart speaker 20 transmits the voice information related to the input request to the voice processing server 50, and the voice processing server 50 converts the voice information into text information and sends it to the management server 30.
 なお音声処理サーバ50は、入力された音声情報が日本語であれば日本語のテキスト情報を管理サーバ30へ送信し、英語であれば英語のテキスト情報を管理サーバ30へ送信する。音声処理サーバ50が非対応の言語による音声情報が入力された場合、音声処理サーバ50は、音声情報からテキスト情報への変換ができない旨の通知を管理サーバ30へ送信する。 If the input voice information is Japanese, the voice processing server 50 transmits Japanese text information to the management server 30, and if it is English, the voice processing server 50 transmits English text information to the management server 30. When voice information in a language that the voice processing server 50 does not support is input, the voice processing server 50 sends a notification to the management server 30 that the voice information cannot be converted into text information.
 管理サーバ30は、音声処理サーバ50から受信したテキスト情報を基に、意図識別モデル81及び応答情報DB82を用いてユーザのリクエストに対する応答情報を取得する。管理サーバ30はこの応答情報をテキスト情報として音声処理サーバ50へ送信し、音声処理サーバ50はテキスト情報を音声情報に変換してスマートスピーカ20へ送信する。スマートスピーカ20は、音声処理サーバ50からの音声情報を受信し、受信した音声情報をユーザのリクエストに対する応答としてスピーカ27から音声出力する。 The management server 30 acquires the response information to the user's request using the intention identification model 81 and the response information DB 82 based on the text information received from the voice processing server 50. The management server 30 transmits this response information as text information to the voice processing server 50, and the voice processing server 50 converts the text information into voice information and transmits it to the smart speaker 20. The smart speaker 20 receives the voice information from the voice processing server 50, and outputs the received voice information from the speaker 27 as a response to the user's request.
 また管理サーバ30は、音声処理サーバ50から受信したテキスト情報に基づいて、意図識別モデル81及び応答情報DB82を用いても応答情報を取得することができない場合、音声応答からチャットボットの自動応答への切り替えを促す切替指示を、音声処理サーバ50を介してスマートスピーカ20へ送信する。このときに管理サーバ30は、チャットボットにアクセスするためのURL等の情報と、スマートスピーカ20が設置された宿泊施設及び客室の識別情報とを含む二次元コードのアクセス用情報20aを生成して切替指示に付して送信する。 Further, when the management server 30 cannot acquire the response information even by using the intention identification model 81 and the response information DB 82 based on the text information received from the voice processing server 50, the management server 30 changes from the voice response to the automatic response of the chatbot. A switching instruction prompting the switching is transmitted to the smart speaker 20 via the voice processing server 50. At this time, the management server 30 generates the access information 20a of the two-dimensional code including the information such as the URL for accessing the chatbot and the identification information of the accommodation facility and the guest room where the smart speaker 20 is installed. Send along with the switching instruction.
 なお管理サーバ30が応答情報を取得できない要因は複数存在し得る。例えば、スマートスピーカ20へ入力された音声が不鮮明であったため音声処理サーバ50が音声情報からテキスト情報への変換を行うことができない事が要因として挙げられる。入力された音声情報が、音声処理サーバ50の非対応言語である場合も同様である。これらの場合、音声処理サーバ50から管理サーバ30へ音声処理が不可能であった旨の通知が与えられ、この通知により管理サーバ30は応答情報を取得できないと判定することができる。 There may be multiple factors that prevent the management server 30 from acquiring response information. For example, the reason is that the voice processing server 50 cannot convert the voice information into the text information because the voice input to the smart speaker 20 is unclear. The same applies when the input voice information is in a language that is not supported by the voice processing server 50. In these cases, the voice processing server 50 gives a notification to the management server 30 that the voice processing was not possible, and it can be determined that the management server 30 cannot acquire the response information by this notification.
 また例えば、意図識別モデル81がリクエストの意図を識別できない事が要因として挙げられる。意図識別モデル81が、入力されたリクエストに対して、識別したリクエストの意図と、この識別結果の確かさとを出力する構成である場合、出力された確かさが所定の閾値を超えない場合に、管理サーバ30は意図識別モデル81が意図を識別できないと判定し、応答情報を取得できないと判定することができる。 Another factor is that the intention identification model 81 cannot identify the intention of the request. When the intention identification model 81 is configured to output the intention of the identified request and the certainty of the identification result for the input request, when the output certainty does not exceed a predetermined threshold value. The management server 30 can determine that the intention identification model 81 cannot identify the intention and cannot acquire the response information.
 また例えば、意図識別モデル81が識別したリクエストの意図及びこれに対応する応答情報が、応答情報DB82に登録されていない事が要因として挙げられる。管理サーバ30は、意図識別モデル81が出力したリクエストの意図に基づいて応答情報DB82を検索し、該当する項目が応答情報DB82に存在しない場合に、応答情報を取得できないと判定することができる。 Further, for example, the reason is that the intention of the request identified by the intention identification model 81 and the response information corresponding thereto are not registered in the response information DB 82. The management server 30 searches the response information DB 82 based on the intention of the request output by the intention identification model 81, and can determine that the response information cannot be acquired when the corresponding item does not exist in the response information DB 82.
 なお管理サーバ30がリクエストに対応する応答情報を取得できないと判定する方法は、上記のものに限らず、様々な方法が採用され得る。 The method of determining that the management server 30 cannot acquire the response information corresponding to the request is not limited to the above method, and various methods can be adopted.
 管理サーバ30からの切替指示を音声処理サーバ50を介して受信したスマートスピーカ20は、チャットボットの自動応答への切り替えを行うか否かを問い合わせるメッセージ等をスピーカ27から音声出力するか又はタッチパネル25に表示し、ユーザによる切り替えの可否を問い合わせる。音声入力またはタッチパネルに対する操作等によりユーザから切り替えを肯定する応答が得られた場合、スマートスピーカ20は、切替指示に付されたアクセス用情報20aの二次元コードをタッチパネル25に表示する。 The smart speaker 20 that receives the switching instruction from the management server 30 via the voice processing server 50 outputs a message or the like inquiring whether to switch to the automatic response of the chatbot by voice from the speaker 27 or the touch panel 25. Is displayed in, and the user inquires whether or not switching is possible. When a response affirming the switching is obtained from the user by voice input or an operation on the touch panel, the smart speaker 20 displays the two-dimensional code of the access information 20a attached to the switching instruction on the touch panel 25.
 スマートスピーカ20のタッチパネル25にアクセス用情報20aが表示された場合、ユーザは自身のスマートフォン40にてアクセス用情報20aを読み取り、管理サーバ30により実現されるチャットボットへのアクセスをスマートフォン40にて行うことができる。スマートフォン40は、図4に示した言語選択ボックスを表示してユーザにチャットで用いる言語を選択させる。スマートフォン40は、選択された言語によるチャット画面を表示する。このチャット画面においてスマートフォン40は、ユーザによるリクエストのテキスト入力を受け付けると共に、管理サーバ30による応答のテキスト出力を行う。なお本実施の形態においてスマートフォン40は、二次元コードを読み取るアプリケーションプログラム等にてアクセス用情報20aの読み取りを行い、読み取ったアクセス用情報20aに含まれるURLへのアクセスをブラウザ等のプログラムにて行うものとする。チャット画面はブラウザ上に表示され、チャット画面を表示するための処理は、管理サーバ30及びスマートフォン40の協働により行われる。 When the access information 20a is displayed on the touch panel 25 of the smart speaker 20, the user reads the access information 20a on his / her smartphone 40 and accesses the chatbot realized by the management server 30 on the smartphone 40. be able to. The smartphone 40 displays the language selection box shown in FIG. 4 and allows the user to select the language to be used in the chat. The smartphone 40 displays a chat screen in the selected language. On this chat screen, the smartphone 40 accepts the text input of the request by the user and outputs the text of the response by the management server 30. In the present embodiment, the smartphone 40 reads the access information 20a by an application program or the like that reads a two-dimensional code, and accesses the URL included in the read access information 20a by a program such as a browser. It shall be. The chat screen is displayed on the browser, and the process for displaying the chat screen is performed by the cooperation of the management server 30 and the smartphone 40.
 なお本実施の形態においては、チャットに用いる言語の選択をユーザが行う構成とするが、これに限るものではない。管理サーバ30は、例えばアクセス用情報20aに含まれる宿泊施設及び客室の識別情報に基づいて、言語を自動的に選択してもよい。例えば管理サーバ30は、スマートフォン40から宿泊施設及び客室の識別情報を取得し、宿泊施設の宿泊客を管理する別のサーバ装置等から、識別情報にて識別される宿泊施設の客室に宿泊しているユーザの国籍等の情報を取得することで、チャットに用いる言語を選択することができる。 In the present embodiment, the user selects the language to be used for chat, but the present invention is not limited to this. The management server 30 may automatically select a language based on, for example, the identification information of the accommodation facility and the guest room included in the access information 20a. For example, the management server 30 acquires the identification information of the accommodation facility and the guest room from the smartphone 40, and stays in the guest room of the accommodation facility identified by the identification information from another server device or the like that manages the guests of the accommodation facility. By acquiring information such as the nationality of the user, the language used for chatting can be selected.
 図12は、本実施の形態に係るスマートスピーカ20が行う処理の手順を示すフローチャートである。本実施の形態に係るスマートスピーカ20の処理部21は、マイク26に対するユーザによるリクエストの音声入力がなされたか否かを判定する(ステップS1)。音声入力がなされていない場合(S1:NO)、処理部21は、リクエストの音声入力がなされるまで待機する。音声入力がなされた場合(S1:YES)、処理部21は、マイク26にて入力された音声を、デジタルデータの音声情報として取得する(ステップS2)。処理部21は、取得した音声情報を、通信部23にて音声処理サーバ50へ送信する(ステップS3)。 FIG. 12 is a flowchart showing a procedure of processing performed by the smart speaker 20 according to the present embodiment. The processing unit 21 of the smart speaker 20 according to the present embodiment determines whether or not the voice input of the request by the user to the microphone 26 has been made (step S1). When no voice input is made (S1: NO), the processing unit 21 waits until the request voice input is made. When voice input is made (S1: YES), the processing unit 21 acquires the voice input by the microphone 26 as voice information of digital data (step S2). The processing unit 21 transmits the acquired voice information to the voice processing server 50 by the communication unit 23 (step S3).
 その後、処理部21は、送信したリクエストに対する応答を受信したか否かを判定する(ステップS4)。応答を受信していない場合(S4:NO)、処理部21は、応答を受信するまで待機する。ここでスマートスピーカ20が受信する応答は、リクエストに対する音声の応答情報、又は、チャットボットへの切替指示である。応答を受信した場合(S4:YES)、処理部21は、受信した応答がチャットボットへの切替指示であるか否かを判定する(ステップS5)。切替指示でない場合(S5:NO)、処理部21は、受信した応答の音声情報をスピーカ27から音声出力し(ステップS6)、処理を終了する。 After that, the processing unit 21 determines whether or not a response to the transmitted request has been received (step S4). If no response has been received (S4: NO), the processing unit 21 waits until the response is received. Here, the response received by the smart speaker 20 is voice response information to the request or an instruction to switch to the chatbot. When the response is received (S4: YES), the processing unit 21 determines whether or not the received response is an instruction to switch to the chatbot (step S5). If it is not a switching instruction (S5: NO), the processing unit 21 outputs the voice information of the received response from the speaker 27 (step S6), and ends the processing.
 受信した応答が切替指示である場合(S5:YES)、処理部21は、切り替えの可否を問い合わせるメッセージ等をスピーカ27から音声出力か又はタッチパネル25に表示することにより、切り替えの可否をユーザに問い合わせる(ステップS7)。処理部21は、この問い合わせに対して切り替えを肯定する回答が得られたか否かを判定する(ステップS8)。切り替えを肯定する回答が得られた場合(S8:YES)、処理部21は、切替指示と共に管理サーバ30から与えられる二次元コードのアクセス用情報20aをタッチパネル25に表示し(ステップS9)、処理を終了する。切り替えを肯定する回答が得られなかった場合(S8:NO)、処理部21は、アクセス用情報20aを表示することなく、処理を終了する。 When the received response is a switching instruction (S5: YES), the processing unit 21 inquires the user whether or not the switching is possible by outputting a voice from the speaker 27 or displaying a message or the like inquiring whether or not the switching is possible on the touch panel 25. (Step S7). The processing unit 21 determines whether or not an answer affirming the switching has been obtained in response to this inquiry (step S8). When an answer affirming the switching is obtained (S8: YES), the processing unit 21 displays the two-dimensional code access information 20a given from the management server 30 together with the switching instruction on the touch panel 25 (step S9), and processes the information. To finish. If no answer affirming the switching is obtained (S8: NO), the processing unit 21 ends the process without displaying the access information 20a.
 図13は、本実施の形態に係る管理サーバ30が行う切替制御処理の手順を示すフローチャートである。本実施の形態に係る管理サーバ30の処理部31のリクエスト取得部31aは、通信部33にて音声処理サーバ50からのテキスト情報によるリクエストを受信したか否かを判定する(ステップS21)。リクエストを受信していない場合(S21:NO)、リクエスト取得部31aは、リクエストを受信するまで待機する。 FIG. 13 is a flowchart showing the procedure of the switching control process performed by the management server 30 according to the present embodiment. The request acquisition unit 31a of the processing unit 31 of the management server 30 according to the present embodiment determines whether or not the communication unit 33 has received the request based on the text information from the voice processing server 50 (step S21). If the request has not been received (S21: NO), the request acquisition unit 31a waits until the request is received.
 リクエストを受信した場合(S21:YES)、処理部31の応答処理部31bは、受信したリクエストのテキスト情報を記憶部32に記憶された意図識別モデル81へ入力する(ステップS22)。次いで応答処理部31bは、意図識別モデル81が出力するリクエストの意図の識別結果を取得する(ステップS23)。取得した識別結果に基づいて、応答処理部31bは、リクエストの意図が識別可能であったか否かを判定する(ステップS24)。リクエストの意図が識別可能であった場合(S24:YES)、応答処理部31bは、識別された意図に基づいて、記憶部32に記憶された応答情報DB82を参照する(ステップS25)。応答処理部31bは、応答情報DB82を参照して応答情報が得られたか否かを判定する(ステップS26)。応答情報が得られた場合(S26:YES)、応答処理部31bは、応答情報DB82から取得した応答情報を、音声処理サーバ50を介してリクエスト送信元のスマートスピーカ20へ送信し(ステップS27)、処理を終了する。 When the request is received (S21: YES), the response processing unit 31b of the processing unit 31 inputs the text information of the received request into the intention identification model 81 stored in the storage unit 32 (step S22). Next, the response processing unit 31b acquires the identification result of the intention of the request output by the intention identification model 81 (step S23). Based on the acquired identification result, the response processing unit 31b determines whether or not the intention of the request can be identified (step S24). When the intention of the request can be identified (S24: YES), the response processing unit 31b refers to the response information DB 82 stored in the storage unit 32 based on the identified intention (step S25). The response processing unit 31b refers to the response information DB 82 and determines whether or not the response information has been obtained (step S26). When the response information is obtained (S26: YES), the response processing unit 31b transmits the response information acquired from the response information DB 82 to the smart speaker 20 of the request transmission source via the voice processing server 50 (step S27). , End the process.
 意図識別モデル81にてリクエストの意図が識別不可能であった場合(S24:NO)、又は、応答情報DB82からリクエストの意図に対する応答情報が得られなかった場合(S26:NO)、処理部31の切替処理部31cは、チャットボットへアクセスするためのアクセス用情報20aを生成する(ステップS28)。切替処理部31cは、生成したアクセス用情報20aを含む切替指示を、音声処理サーバ50を介してリクエスト送信元のスマートスピーカ20へ送信し(ステップS29)、処理を終了する。 When the intention of the request cannot be identified by the intention identification model 81 (S24: NO), or when the response information for the intention of the request cannot be obtained from the response information DB 82 (S26: NO), the processing unit 31 The switching processing unit 31c of the above generates access information 20a for accessing the chatbot (step S28). The switching processing unit 31c transmits a switching instruction including the generated access information 20a to the smart speaker 20 of the request transmission source via the voice processing server 50 (step S29), and ends the processing.
 図14は、本実施の形態に係るスマートフォン40が行うチャット開始処理の手順を示すフローチャートである。本実施の形態に係るスマートフォン40の処理部41は、例えばユーザの操作に応じてアプリケーションプログラムを起動し、二次元コードの読み取りを行うモードへと移行する(ステップS41)。本モードにおいて処理部41のアクセス用情報取得部41aは、カメラ45にてアクセス用情報20aを撮像することにより、アクセス用情報20aを取得する(ステップS42)。アクセス用情報取得部41aは、取得したアクセス用情報20aに含まれるURL及び識別情報を取得する(ステップS43)。 FIG. 14 is a flowchart showing a procedure of chat start processing performed by the smartphone 40 according to the present embodiment. The processing unit 41 of the smartphone 40 according to the present embodiment starts an application program in response to, for example, a user operation, and shifts to a mode for reading a two-dimensional code (step S41). In this mode, the access information acquisition unit 41a of the processing unit 41 acquires the access information 20a by capturing the access information 20a with the camera 45 (step S42). The access information acquisition unit 41a acquires the URL and the identification information included in the acquired access information 20a (step S43).
 次いで処理部41のチャット処理部41bは、取得したURL及び識別情報に基づいて管理サーバ30へのアクセスを行う(ステップS44)。なおこのときにスマートフォン40がアクセス用情報20aから取得した識別情報を管理サーバ30へ送信することにより、管理サーバ30は、いずれの宿泊施設及び客室のユーザからのアクセスであるかを判断することができる。チャット処理部41bは、言語の選択ボックスを表示して、チャットに用いる言語の選択を受け付ける(ステップS45)。チャット処理部41bは、選択された言語により、管理サーバ30が提供するチャットボットとのチャット処理を開始し(ステップS46)、処理を終了する。 Next, the chat processing unit 41b of the processing unit 41 accesses the management server 30 based on the acquired URL and the identification information (step S44). At this time, by transmitting the identification information acquired by the smartphone 40 from the access information 20a to the management server 30, the management server 30 can determine which accommodation facility or guest room user is accessing. it can. The chat processing unit 41b displays a language selection box and accepts the selection of the language used for chat (step S45). The chat processing unit 41b starts a chat process with the chatbot provided by the management server 30 in the selected language (step S46), and ends the process.
(2)チャットボットの自動応答からオペレータの有人応答への切り替え
 スマートフォン40を利用してチャットを開始したユーザは、テキスト入力によりリクエストの入力を行うことができる。入力されたリクエストのテキスト情報はスマートフォン40から翻訳サーバ60へ送信され、必要に応じてテキスト情報の翻訳が翻訳サーバ60にて行われる。翻訳サーバ60は、翻訳したテキスト情報又は翻訳が不要なテキスト情報を管理サーバ30へ送信する。なお、テキスト情報の翻訳が必要でない場合には、スマートフォン40から管理サーバ30へ直接的にテキスト情報が送信されてもよい。
(2) Switching from the automatic response of the chatbot to the manned response of the operator The user who has started the chat using the smartphone 40 can input the request by text input. The text information of the input request is transmitted from the smartphone 40 to the translation server 60, and the text information is translated by the translation server 60 as necessary. The translation server 60 transmits the translated text information or the text information that does not need to be translated to the management server 30. If translation of the text information is not required, the text information may be directly transmitted from the smartphone 40 to the management server 30.
 管理サーバ30は、翻訳サーバ60から受信したテキスト情報を基に、意図識別モデル81及び応答情報DB82を用いてユーザのリクエストに対する応答情報を取得する。管理サーバ30はこの応答情報をテキスト情報として翻訳サーバ60へ送信し、翻訳サーバ60は必要に応じてテキスト情報を翻訳する。翻訳サーバ60は、翻訳したテキスト情報又は翻訳が不要なテキスト情報を、スマートフォン40へ送信する。スマートフォン40は、翻訳サーバ60からのテキスト情報を受信し、受信したテキスト情報をユーザのリクエストに対する応答としてチャット画面に表示する。 The management server 30 acquires the response information to the user's request using the intention identification model 81 and the response information DB 82 based on the text information received from the translation server 60. The management server 30 transmits this response information as text information to the translation server 60, and the translation server 60 translates the text information as needed. The translation server 60 transmits the translated text information or the text information that does not need to be translated to the smartphone 40. The smartphone 40 receives the text information from the translation server 60 and displays the received text information on the chat screen as a response to the user's request.
 また管理サーバ30は、翻訳サーバ60から受信したテキスト情報に基づいて、意図識別モデル81及び応答情報DB82を用いても応答情報を取得することができない場合、チャットボットの自動応答からオペレータの有人応答への切り替えを促す切替指示を、翻訳サーバ60を介してスマートフォン40へ送信する。 Further, when the management server 30 cannot acquire the response information even by using the intention identification model 81 and the response information DB 82 based on the text information received from the translation server 60, the management server 30 makes a manned response of the operator from the automatic response of the chatbot. A switching instruction prompting the user to switch to is transmitted to the smartphone 40 via the translation server 60.
 なお管理サーバ30が応答情報を取得できない要因は複数存在し得る。例えば、意図識別モデル81がリクエストの意図を識別できない事が要因として挙げられる。また例えば、意図識別モデル81が識別したリクエストの意図及びこれに対応する応答情報が、応答情報DB82に登録されていない事が要因として挙げられる。管理サーバ30がリクエストに対応する応答情報を取得できないと判定する方法は、様々な方法が採用され得る。 There may be multiple factors that prevent the management server 30 from acquiring response information. For example, the fact that the intention identification model 81 cannot identify the intention of the request can be mentioned as a factor. Further, for example, the reason is that the intention of the request identified by the intention identification model 81 and the response information corresponding thereto are not registered in the response information DB 82. Various methods can be adopted as a method for determining that the management server 30 cannot acquire the response information corresponding to the request.
 管理サーバ30からの切替指示を翻訳サーバ60を介して受信したスマートフォン40は、オペレータの有人応答への切り替えを行うか否かを問い合わせるメッセージ等をチャット画面に表示し、ユーザによる切り替えの可否を問い合わせる。この問い合わせに対してユーザは、例えばチャット画面に設けられた切替アイコンへのタッチ操作を行うことで、有人応答への切り替えを行うことができる。ユーザから切り替えを肯定する回答が得られた場合、スマートフォン40は、有人応答への切り替えを行う要求を、翻訳サーバ60を介して管理サーバ30へ送信する。 The smartphone 40, which receives the switching instruction from the management server 30 via the translation server 60, displays a message or the like inquiring whether or not to switch to the manned response of the operator on the chat screen, and inquires whether or not the user can switch. .. In response to this inquiry, the user can switch to a manned response by, for example, touching the switching icon provided on the chat screen. When the user gives an affirmative answer to the switch, the smartphone 40 transmits a request for switching to the manned response to the management server 30 via the translation server 60.
 スマートフォン40からの切り替えの要求を受信した管理サーバ30は、その時点における日時情報を取得し、取得した日時情報に基づいて記憶部32に記憶されたオペレータDB83を参照する。管理サーバ30は、日時情報及びオペレータDB83に基づいて有人応答を依頼するオペレータを決定し、決定したオペレータが使用するオペレータ端末70に有人応答を行う指示を送信する。またこのときに管理サーバ30は、オペレータが対応可能な言語を考慮して、切り替え先のオペレータを決定してもよい。 The management server 30 that has received the switching request from the smartphone 40 acquires the date and time information at that time, and refers to the operator DB 83 stored in the storage unit 32 based on the acquired date and time information. The management server 30 determines an operator requesting a manned response based on the date and time information and the operator DB 83, and transmits an instruction to perform the manned response to the operator terminal 70 used by the determined operator. At this time, the management server 30 may determine the operator to switch to in consideration of the languages that the operator can handle.
 また管理サーバ30は、オペレータが有人応答の対応を開始する前に、これまでの経緯及びユーザに関する情報等を送信してオペレータ端末70に表示させる。これまでの経緯は、例えばスマートスピーカ20にて入出力された音声の内容を示すテキスト情報、及び、チャットボットの自動応答にて入出力されたテキスト情報等が含まれ得る。管理サーバ30は、これらの情報を履歴情報として記憶部32に記憶している。またユーザに関する情報は、例えばユーザの性別、年齢及び国籍等の情報とすることができ、宿泊施設の宿泊客に関する情報を記憶する他のサーバ装置等から取得される。これらの情報を予めオペレータ端末70に表示することで、有人応答を行うオペレータが円滑にユーザとのチャットを開始することができる。 Further, the management server 30 transmits information such as the history and the user so far and displays it on the operator terminal 70 before the operator starts to respond to the manned response. The history so far may include, for example, text information indicating the content of the voice input / output by the smart speaker 20, text information input / output by the automatic response of the chatbot, and the like. The management server 30 stores this information as history information in the storage unit 32. Further, the information about the user can be, for example, information such as the gender, age, and nationality of the user, and is acquired from another server device or the like that stores information about the guest of the accommodation facility. By displaying this information on the operator terminal 70 in advance, the operator who makes a manned response can smoothly start chatting with the user.
 以後、管理サーバ30は、スマートフォン40及びオペレータ端末70の間のテキスト情報の送受信を中継することによって、ユーザ及びオペレータのテキスト情報によるチャットを成立させる。 After that, the management server 30 relays the transmission and reception of text information between the smartphone 40 and the operator terminal 70 to establish a chat using the text information of the user and the operator.
 また本実施の形態においては、オペレータによる有人応答が終了した後、管理サーバ30は、有人応答を行ったオペレータに対して、意図識別モデル81を再学習させるための教師データの作成を依頼する。管理サーバ30は、例えばユーザへの有人応答を終えたオペレータ端末70に、このユーザのリクエストの意図がどのようなものであったかを問い合わせる。このときに管理サーバ30は、有人応答の前に音声応答又はチャットボットの自動応答においてユーザが入力したテキスト情報をオペレータ端末70へ送信し、これらのテキスト情報をオペレータ端末70に表示させてもよい。オペレータ端末70に表示されたメッセージ等に基づいて、オペレータは、有人応答を行ったユーザのリクエストの意図がどのようなものであったかを入力する。オペレータ端末70は、オペレータにより入力された情報を管理サーバ30へ送信する。管理サーバ30は、オペレータ端末70からの情報を受信し、ユーザの入力に係るリクエストのテキスト情報と、オペレータにより入力されたリクエストの意図とを対応付けた教師データを作成して、記憶部32に記憶する。なお、オペレータ端末70にてオペレータにより入力された情報は、意図識別モデル81の学習に用いるのみでなく、応答情報DB82への応答情報の追加又は修正等に用いることもできる。 Further, in the present embodiment, after the manned response by the operator is completed, the management server 30 requests the operator who made the manned response to create teacher data for re-learning the intention identification model 81. The management server 30 inquires, for example, the operator terminal 70 that has completed the manned response to the user what the intention of the user's request was. At this time, the management server 30 may transmit the text information input by the user in the voice response or the automatic response of the chatbot before the manned response to the operator terminal 70, and display the text information on the operator terminal 70. .. Based on the message or the like displayed on the operator terminal 70, the operator inputs what the intention of the request of the user who made the manned response was. The operator terminal 70 transmits the information input by the operator to the management server 30. The management server 30 receives the information from the operator terminal 70, creates teacher data in which the text information of the request related to the user's input and the intention of the request input by the operator are associated with each other, and stores the teacher data in the storage unit 32. Remember. The information input by the operator at the operator terminal 70 can be used not only for learning the intention identification model 81, but also for adding or modifying the response information to the response information DB 82.
 図15は、本実施の形態に係るスマートフォン40が行うチャット処理の手順を示すフローチャートである。本実施の形態に係るスマートフォン40の処理部41のチャット処理部41bは、タッチパネル44に対する操作に基づいて、ユーザによるリクエストの入力を受け付ける(ステップS51)。チャット処理部41bは、受け付けたリクエストのテキスト情報を、通信部43にて翻訳サーバ60へ送信する(ステップS52)。 FIG. 15 is a flowchart showing a procedure of chat processing performed by the smartphone 40 according to the present embodiment. The chat processing unit 41b of the processing unit 41 of the smartphone 40 according to the present embodiment accepts the input of the request by the user based on the operation on the touch panel 44 (step S51). The chat processing unit 41b transmits the text information of the received request to the translation server 60 by the communication unit 43 (step S52).
 その後、チャット処理部41bは、送信したリクエストに対する応答を受信したか否かを判定する(ステップS53)。応答を受信していない場合(S53:NO)、チャット処理部41bは、応答を受信するまで待機する。ここでスマートフォン40が受信する応答は、リクエストに対する応答のテキスト情報、又は、オペレータによる有人応答への切替指示である。応答を受信した場合(S53:YES)、チャット処理部41bは、受信した応答が有人応答への切替指示であるか否かを判定する(ステップS54)。切替指示でない場合(S54:NO)、チャット処理部41bは、受信した応答のテキスト情報をチャット画面に表示し(ステップS55)、処理を終了する。 After that, the chat processing unit 41b determines whether or not a response to the transmitted request has been received (step S53). If no response has been received (S53: NO), the chat processing unit 41b waits until the response is received. Here, the response received by the smartphone 40 is text information of the response to the request or an instruction to switch to the manned response by the operator. When the response is received (S53: YES), the chat processing unit 41b determines whether or not the received response is an instruction to switch to the manned response (step S54). If it is not a switching instruction (S54: NO), the chat processing unit 41b displays the text information of the received response on the chat screen (step S55), and ends the process.
 受信した応答が切替指示である場合(S54:YES)、チャット処理部41bは、切り替えの可否を問い合わせるメッセージを表示することにより、切り替えの可否をユーザに問い合わせる(ステップS56)。チャット処理部41bは、この問い合わせに対して切り替えを肯定する回答が得られたか否かを判定する(ステップS57)。切り替えを肯定する回答が得られた場合(S57:YES)、チャット処理部41bは、有人応答への切り替えの要求を、翻訳サーバ60を介して管理サーバ30へ送信し(ステップS58)、処理を終了する。切り替えを肯定する回答が得られなかった場合(S57:NO)、チャット処理部41bは、切り替えの要求を送信することなく、処理を終了する。 When the received response is a switching instruction (S54: YES), the chat processing unit 41b inquires the user whether or not the switching is possible by displaying a message inquiring whether or not the switching is possible (step S56). The chat processing unit 41b determines whether or not an answer affirming the switching has been obtained for this inquiry (step S57). When an answer affirming the switching is obtained (S57: YES), the chat processing unit 41b transmits a request for switching to the manned response to the management server 30 via the translation server 60 (step S58), and performs processing. finish. If no answer affirming the switch is obtained (S57: NO), the chat processing unit 41b ends the process without transmitting the switch request.
 図16は、本実施の形態に係る管理サーバ30が行う切替制御処理の手順を示すフローチャートである。本実施の形態に係る管理サーバ30の処理部31のリクエスト取得部31aは、通信部33にて翻訳サーバ60からのテキスト情報によるリクエストを受信したか否かを判定する(ステップS71)。リクエストを受信していない場合(S71:NO)、リクエスト取得部31aは、リクエストを受信するまで待機する。 FIG. 16 is a flowchart showing a procedure of switching control processing performed by the management server 30 according to the present embodiment. The request acquisition unit 31a of the processing unit 31 of the management server 30 according to the present embodiment determines whether or not the communication unit 33 has received the request based on the text information from the translation server 60 (step S71). If the request has not been received (S71: NO), the request acquisition unit 31a waits until the request is received.
 リクエストを受信した場合(S71:YES)、処理部31の応答処理部31bは、受信したリクエストのテキスト情報を記憶部32に記憶された意図識別モデル81へ入力する(ステップS72)。次いで応答処理部31bは、意図識別モデル81が出力するリクエストの意図の識別結果を取得する(ステップS73)。取得した識別結果に基づいて、応答処理部31bは、リクエストの意図が識別可能であったか否かを判定する(ステップS74)。リクエストの意図が識別可能であった場合(S74:YES)、応答処理部31bは、識別された意図に基づいて、記憶部32に記憶された応答情報DB82を参照する(ステップS75)。応答処理部31bは、応答情報DB82を参照して応答情報が得られたか否かを判定する(ステップS76)。応答情報が得られた場合(S76:YES)、応答処理部31bは、応答情報DB82から取得した応答情報を、翻訳サーバ60を介してリクエスト送信元のスマートフォン40へ送信し(ステップS77)、処理を終了する。 When the request is received (S71: YES), the response processing unit 31b of the processing unit 31 inputs the text information of the received request into the intention identification model 81 stored in the storage unit 32 (step S72). Next, the response processing unit 31b acquires the identification result of the intention of the request output by the intention identification model 81 (step S73). Based on the acquired identification result, the response processing unit 31b determines whether or not the intention of the request can be identified (step S74). When the intention of the request can be identified (S74: YES), the response processing unit 31b refers to the response information DB 82 stored in the storage unit 32 based on the identified intention (step S75). The response processing unit 31b refers to the response information DB 82 and determines whether or not the response information has been obtained (step S76). When the response information is obtained (S76: YES), the response processing unit 31b transmits the response information acquired from the response information DB 82 to the request transmission source smartphone 40 via the translation server 60 (step S77), and processes the response information. To finish.
 意図識別モデル81にてリクエストの意図が識別不可能であった場合(S74:NO)、又は、応答情報DB82からリクエストの意図に対する応答情報が得られなかった場合(S76:NO)、処理部31の切替処理部31cは、チャットボットの自動応答からオペレータによる有人応答への切替指示を、音声処理サーバ50を介してリクエスト送信元のスマートフォン40へ送信し(ステップS78)、処理を終了する。 When the intention of the request cannot be identified by the intention identification model 81 (S74: NO), or when the response information for the intention of the request cannot be obtained from the response information DB 82 (S76: NO), the processing unit 31 The switching processing unit 31c of the above transmits a switching instruction from the automatic response of the chatbot to the manned response by the operator to the smartphone 40 of the request transmission source via the voice processing server 50 (step S78), and ends the processing.
 図17は、本実施の形態に係る管理サーバ30が行う有人応答への切替処理の手順を示すフローチャートである。本実施の形態に係る管理サーバ30の処理部31の切替処理部31cは、翻訳サーバ60を介してスマートフォン40からの有人応答への切り替え要求を受信したか否かを判定する(ステップS91)。有人応答への切り替えの要求を受信していない場合(S91:NO)、切替処理部31cは、要求を受信するまで待機する。 FIG. 17 is a flowchart showing the procedure of the switching process to the manned response performed by the management server 30 according to the present embodiment. The switching processing unit 31c of the processing unit 31 of the management server 30 according to the present embodiment determines whether or not a request for switching to a manned response from the smartphone 40 has been received via the translation server 60 (step S91). When the request for switching to the manned response has not been received (S91: NO), the switching processing unit 31c waits until the request is received.
 有人応答への切り替えの要求を受信した場合(S91:YES)、切替処理部31cは、その時点での日時の情報を取得する(ステップS92)。なお日時の情報は、例えば管理サーバ30にて動作するオペレーティングシステム等から取得することができる。切替処理部31cは、取得した日時情報に基づいて、記憶部32に記憶されたオペレータDB83を参照する(ステップS93)。切替処理部31cは、日時情報及びオペレータDB83に基づいて、有人応答を行うオペレータを決定する(ステップS94)。切替処理部31cは、決定したオペレータが使用するオペレータ端末70に応答を行う指示を与える(ステップS95)。 When a request for switching to a manned response is received (S91: YES), the switching processing unit 31c acquires information on the date and time at that time (step S92). The date and time information can be obtained from, for example, an operating system running on the management server 30. The switching processing unit 31c refers to the operator DB 83 stored in the storage unit 32 based on the acquired date and time information (step S93). The switching processing unit 31c determines an operator to perform a manned response based on the date and time information and the operator DB 83 (step S94). The switching processing unit 31c gives an instruction to give a response to the operator terminal 70 used by the determined operator (step S95).
 その後、処理部41は、スマートフォン40からのユーザのリクエストをオペレータ端末70へ送信し、オペレータ端末70からのオペレータの応答をスマートフォン40へ送信することにより、リクエスト及び応答の中継を行う(ステップS96)。処理部41は、有人応答が終了したか否かを判定する(ステップS97)。有人応答が終了していない場合(S97:NO)、処理部41は、ステップS96へ処理を戻し、リクエスト及び応答の中継を継続して行う。 After that, the processing unit 41 transmits the user's request from the smartphone 40 to the operator terminal 70, and the operator's response from the operator terminal 70 is transmitted to the smartphone 40 to relay the request and the response (step S96). .. The processing unit 41 determines whether or not the manned response is completed (step S97). When the manned response is not completed (S97: NO), the processing unit 41 returns the processing to step S96 and continuously relays the request and the response.
 有人応答が終了した場合(S97:YES)、処理部41は、有人応答を行ったオペレータ端末70に対して、ユーザのリクエストの意図を問い合わせる(ステップS98)。処理部41は、この問い合わせに対するオペレータの回答に基づいて、リクエストのテキスト情報とリクエストの意図とを対応付けた教師データを生成して記憶部32に記憶し(ステップS99)、処理を終了する。 When the manned response is completed (S97: YES), the processing unit 41 inquires of the operator terminal 70 that has performed the manned response the intention of the user's request (step S98). Based on the operator's response to this inquiry, the processing unit 41 generates teacher data in which the text information of the request and the intention of the request are associated with each other and stores it in the storage unit 32 (step S99), and ends the processing.
<意図識別モデルの再学習>
 本実施の形態に係る情報提供システムでは、オペレータによる有人応答の結果により作成された教師データを用いて、管理サーバ30の意図識別モデル81の学習(再学習)処理が行われる。意図識別モデル81の再学習は、例えば1ヶ月に1回等の期間的なタイミング、又は、教師データの蓄積量が所定量を超えたタイミング等に行われる。
<Re-learning of intention discriminative model>
In the information providing system according to the present embodiment, the learning (re-learning) process of the intention identification model 81 of the management server 30 is performed using the teacher data created as a result of the manned response by the operator. The re-learning of the intention identification model 81 is performed at a periodic timing such as once a month, or at a timing when the accumulated amount of teacher data exceeds a predetermined amount.
 図18は、本実施の形態に係る管理サーバ30が行う意図識別モデル81の学習処理の手順を示すフローチャートである。本実施の形態に係る管理サーバ30の処理部31の学習処理部31dは、意図識別モデル81の再学習を行うタイミングに至ったか否かを判定する(ステップS111)。再学習を行うタイミングに至っていない場合(S111:NO)、学習処理部31dは、再学習を行うタイミングに至るまで待機する。 FIG. 18 is a flowchart showing a procedure of learning processing of the intention identification model 81 performed by the management server 30 according to the present embodiment. The learning processing unit 31d of the processing unit 31 of the management server 30 according to the present embodiment determines whether or not the timing for re-learning the intention identification model 81 has been reached (step S111). When the timing for re-learning has not been reached (S111: NO), the learning processing unit 31d waits until the timing for re-learning is reached.
 再学習を行うタイミングに至った場合(S111:YES)、学習処理部31dは、記憶部32に記憶された学習データを読み出す(ステップS112)。また学習処理部31dは、記憶部32に記憶された意図識別モデル81を読み出す。学習処理部31dは、読み出した学習データを用いて意図識別モデル81を学習させる処理を行う(ステップS114)。学習処理部31dは、学習処理を終えた学習済の意図識別モデル81を記憶部32に記憶し(ステップS115)、処理を終了する。 When the timing for re-learning is reached (S111: YES), the learning processing unit 31d reads out the learning data stored in the storage unit 32 (step S112). Further, the learning processing unit 31d reads out the intention identification model 81 stored in the storage unit 32. The learning processing unit 31d performs a process of learning the intention identification model 81 using the read learning data (step S114). The learning processing unit 31d stores the learned intention identification model 81 that has completed the learning processing in the storage unit 32 (step S115), and ends the processing.
<宿泊前の利用>
 本実施の形態に係る情報提供システムでは、宿泊施設に宿泊しているユーザ(宿泊客)のみでなく、宿泊する前のユーザ(例えば予約客)に対しても同様の方法で情報を提供することが可能である。図19は、本実施の形態に係る情報提供システムによる宿泊前の情報提供を説明するための模式図である。例えばユーザは、宿泊施設に対する宿泊の予約を予約サーバ80に対して行うことができる。予約サーバ80は、ユーザによる宿泊の予約が完了した後、宿泊施設の情報を提供する情報提供システムへアクセスするためのアクセス用情報を、予約したユーザへ送信する。
<Use before staying>
In the information providing system according to the present embodiment, information is provided not only to the user (guest) staying at the accommodation facility but also to the user (for example, a reservation guest) before staying in the same way. Is possible. FIG. 19 is a schematic diagram for explaining information provision before accommodation by the information provision system according to the present embodiment. For example, the user can make an accommodation reservation for the accommodation facility to the reservation server 80. After the reservation of the accommodation by the user is completed, the reservation server 80 transmits the access information for accessing the information providing system that provides the information of the accommodation facility to the reserved user.
 予約サーバ80からユーザへのアクセス用情報の送信は、例えばユーザに係る情報として登録された電子メールアドレスに対するメール送信により行うことができる。ユーザへの電子メールには、例えば情報提供システム又は管理サーバ30へアクセスするためのURL及び予約した宿泊施設の識別情報等がアクセス用情報として付され得る。予約サーバ80が送信した電子メールは例えばユーザのスマートフォン40にて受信され、電子メールに付されたアクセス用情報に基づいてユーザはスマートフォン40にて情報提供システムにアクセスする。この場合にユーザは、管理サーバ30が提供するチャットボットの自動応答による情報提供を利用することができる。また例えば、電子メールにはアクセス用情報の二次元コードの画像が添付されていてもよい。ユーザは、二次元コードをスマートフォン40に表示し、自宅に設置された自身のスマートスピーカ20にてカメラ28を利用して二次元コードを読み取り、スマートスピーカ20にて情報提供システムへのアクセスを行ってもよい。この場合にユーザは、管理サーバ30が提供する音声応答による情報提供をスマートスピーカ20にて利用することができる。 The access information from the reservation server 80 to the user can be transmitted, for example, by sending an e-mail to the e-mail address registered as the information related to the user. For example, a URL for accessing the information providing system or the management server 30, identification information of the reserved accommodation facility, and the like may be attached to the e-mail to the user as access information. The e-mail transmitted by the reservation server 80 is received by, for example, the user's smartphone 40, and the user accesses the information providing system on the smartphone 40 based on the access information attached to the e-mail. In this case, the user can use the information provision by the automatic response of the chatbot provided by the management server 30. Further, for example, an image of a two-dimensional code of access information may be attached to the e-mail. The user displays the two-dimensional code on the smartphone 40, reads the two-dimensional code using the camera 28 with his / her own smart speaker 20 installed at home, and accesses the information providing system with the smart speaker 20. You may. In this case, the user can use the information provision by the voice response provided by the management server 30 on the smart speaker 20.
 また例えば、予約サーバ80からユーザへのアクセス用情報の送信は、郵送によるものであってもよい。ユーザへ郵送される宿泊施設の案内等に、アクセス用情報の二次元コードが印刷された手紙又はカード等が同封される。ユーザは、自身のスマートフォン40又はスマートスピーカ20にて二次元コードの読み取りを行い、自身のスマートフォン40又はスマートスピーカ20にて情報提供システムを利用することができる。 Further, for example, the transmission of the access information from the reservation server 80 to the user may be by mail. A letter or card printed with a two-dimensional code of access information is enclosed in the accommodation facility information mailed to the user. The user can read the two-dimensional code with his / her own smartphone 40 or smart speaker 20 and use the information providing system with his / her own smartphone 40 or smart speaker 20.
 情報提供システムの管理サーバ30は、宿泊客であるか予約客であるかに関わらず、ユーザからのリクエストに対して同様の応答を行うことができる。ただし管理サーバ30は、予約客のユーザに対しては応答として与える情報に制限を設けてもよい。また情報提供システムは、宿泊客及び予約客以外のユーザ、例えば宿泊を検討しているユーザ等に対する情報提供を行ってもよい。この場合にユーザは、自身のスマートフォン40又はスマートスピーカ20を利用して宿泊施設のウェブサイト等にアクセスし、問い合わせ等のリクエストを発することで、情報提供システムを利用することができる。 The management server 30 of the information providing system can make a similar response to a request from a user regardless of whether the guest is a guest or a reservation guest. However, the management server 30 may set a limit on the information given as a response to the reserved guest user. Further, the information providing system may provide information to users other than the guest and the reserved guest, for example, a user who is considering staying. In this case, the user can use the information providing system by accessing the website of the accommodation facility or the like using his / her own smartphone 40 or the smart speaker 20 and issuing a request such as an inquiry.
<まとめ>
 以上の構成の本実施の形態に係る情報提供システムは、宿泊施設の客室に設置されたスマートスピーカ20にてユーザからの音声入力に基づくリクエストを取得し、取得したリクエストに対する応答情報をスマートスピーカ20にて音声出力する。取得したリクエストに対する応答情報の出力を行うことができない場合、管理サーバ30は、テキスト入出力による自動応答への切り替えを行う。
<Summary>
In the information providing system according to the present embodiment having the above configuration, the smart speaker 20 installed in the guest room of the accommodation facility acquires a request based on the voice input from the user, and the response information to the acquired request is obtained by the smart speaker 20. Output audio with. When the response information for the acquired request cannot be output, the management server 30 switches to the automatic response by text input / output.
 リクエストに対する応答情報の出力を行うことができない場合とは、ユーザのリクエストに対応する応答情報を応答情報DB82から取得することができず、リクエストに対して適切な応答を行うことができない場合、リクエスト毎に固有の応答を行うことができない場合である。詳細には、例えばユーザが発話した言語が音声処理サーバ50の音声処理の対応言語に含まれていない場合、ユーザの発話内容が音声処理サーバ50により適切にテキスト情報へ変換できない場合、ユーザのリクエストが意図識別モデル81にて意図を識別できない場合、又は、応答情報DB82に対応する応答情報が記憶されていない場合等が含まれる。 When the response information for the request cannot be output, when the response information corresponding to the user's request cannot be obtained from the response information DB 82 and an appropriate response cannot be performed for the request, the request is made. This is the case when it is not possible to make a unique response for each. Specifically, for example, when the language spoken by the user is not included in the language supported by the voice processing server 50, or when the content of the user's utterance cannot be appropriately converted into text information by the voice processing server 50, the user's request. Includes the case where the intention cannot be identified by the intention identification model 81, or the case where the response information corresponding to the response information DB 82 is not stored.
 なお、ユーザのリクエストに対して、例えばリクエストの意図が識別できない場合等に「リクエストに応答できません」等の所定のメッセージを一律に応答することがあり得る。このような応答は、リクエスト毎に固有の応答ではなく、リクエストに対する適切な応答ではない。このような応答を行う場合であっても、本実施の形態においては、リクエストに対する応答情報の出力を行うことができない場合に含まれる。 Note that it is possible that a predetermined message such as "Cannot respond to the request" may be uniformly responded to the user's request, for example, when the intention of the request cannot be identified. Such a response is not a unique response for each request and is not an appropriate response to the request. Even when such a response is performed, in the present embodiment, it is included in the case where the response information for the request cannot be output.
 音声入出力からテキスト入出力へ切り替えを行う場合、管理サーバ30は、テキスト入出力による自動応答へアクセスするためのアクセス用情報20aをスマートスピーカ20のタッチパネル25に表示させる。これによりユーザは、スマートスピーカ20に表示されたアクセス用情報20aを利用して、音声入力からテキスト入出力への切り替えを容易に行うことができる。 When switching from voice input / output to text input / output, the management server 30 displays access information 20a for accessing the automatic response by text input / output on the touch panel 25 of the smart speaker 20. As a result, the user can easily switch from voice input to text input / output by using the access information 20a displayed on the smart speaker 20.
 また本実施の形態に係る管理サーバ30は、スマートスピーカ20が表示したアクセス用情報20aに基づくスマートフォン40からのアクセスを受け付け、スマートフォン40に入力されたテキストに基づくリクエストを取得し、取得したリクエストに対する応答情報をスマートフォン40にテキストで出力させる。これによりユーザは、自身の使い慣れたスマートフォン40を利用してテキスト入力を行い、リクエストに対する応答を取得することができる。 Further, the management server 30 according to the present embodiment receives access from the smartphone 40 based on the access information 20a displayed by the smart speaker 20, acquires a request based on the text input to the smartphone 40, and responds to the acquired request. The response information is output to the smartphone 40 as a text. As a result, the user can input text using his / her familiar smartphone 40 and obtain a response to the request.
 また本実施の形態に係る管理サーバ30は、テキスト情報の入力に基づくリクエストに対して応答情報が出力できない場合に、オペレータによる有人応答への切り替えを行う。これにより自動応答では対応できないユーザのリクエストに対して、オペレータによる確実な応答を行うことができる。 Further, the management server 30 according to the present embodiment switches to a manned response by the operator when the response information cannot be output in response to the request based on the input of the text information. As a result, the operator can reliably respond to the user's request that cannot be handled by the automatic response.
 また本実施の形態に係る管理サーバ30は、日時に応じたオペレータの対応をオペレータDB83に記憶しており、切り替えを行う日時情報に基づいてオペレータDB83を参照し、有人応答の切り替え先となるオペレータを決定する。これにより管理サーバ30は、例えばオペレータセンターの勤務時刻等に応じて、有人応答の切り替え先を適切に決定することができ、ユーザからのリクエストに対して確実に有人応答を行うことができる。 Further, the management server 30 according to the present embodiment stores the operator's response according to the date and time in the operator DB 83, refers to the operator DB 83 based on the date and time information for switching, and is the operator to which the manned response is switched. To determine. As a result, the management server 30 can appropriately determine the switching destination of the manned response according to, for example, the working time of the operator center, and can reliably perform the manned response to the request from the user.
 また本実施の形態に係る管理サーバ30は、ユーザのリクエストに係る情報を入力として受け付け、リクエストの意図を識別して出力するよう学習された意図識別モデル81を用いる。管理サーバ30は、ユーザのリクエストに係る情報を取得して意図識別モデル81へ入力し、意図識別モデル81が出力するリクエストの意図を取得し、取得したリクエストの意図に対する応答情報をスマートスピーカ20又はスマートフォン40に出力させる。ユーザのリクエストに対して応答情報が出力できずにオペレータの有人応答への切り替えを行った場合、管理サーバ30は、この有人応答の結果に基づいて意図識別モデル81の再学習を行う。これにより、意図識別モデル81の識別能力を向上させ、ユーザのリクエストに対する自動応答の精度を高めることができる。 Further, the management server 30 according to the present embodiment uses an intention identification model 81 learned to receive information related to a user's request as input, identify the intention of the request, and output the information. The management server 30 acquires the information related to the user's request and inputs it to the intention identification model 81, acquires the intention of the request output by the intention identification model 81, and inputs the response information to the acquired request intention to the smart speaker 20 or the smart speaker 20. Output to the smartphone 40. When the response information cannot be output in response to the user's request and the operator switches to the manned response, the management server 30 relearns the intention identification model 81 based on the result of the manned response. As a result, the discriminative ability of the intention identification model 81 can be improved, and the accuracy of the automatic response to the user's request can be improved.
 また本実施の形態に係る管理サーバ30は、リクエストに対する応答情報を応答情報DB82に記憶しており、応答情報DB82は宿泊施設毎に設けられる。スマートスピーカ20及びスマートフォン40は、ユーザが入力したリクエストと共に、宿泊施設及び客室を識別する識別情報を送信する。管理サーバ30は、ユーザからのリクエストと共に受信した識別情報に基づいて、宿泊施設に対応する応答情報DB82を利用し、リクエストに対する応答情報を取得する。管理サーバ30は、取得した応答情報をスマートスピーカ20又はスマートフォン40から出力させる。これにより管理サーバ30は、宿泊施設に適した応答情報をユーザに対して提供することができる。 Further, the management server 30 according to the present embodiment stores the response information for the request in the response information DB 82, and the response information DB 82 is provided for each accommodation facility. The smart speaker 20 and the smartphone 40 transmit identification information for identifying the accommodation facility and the guest room together with the request input by the user. The management server 30 uses the response information DB 82 corresponding to the accommodation facility based on the identification information received together with the request from the user, and acquires the response information for the request. The management server 30 outputs the acquired response information from the smart speaker 20 or the smartphone 40. As a result, the management server 30 can provide the user with response information suitable for the accommodation facility.
 また本実施の形態に係る情報提供システムは、宿泊前のユーザに対しても情報提供を行う。例えば予約サーバ80は、宿泊施設に宿泊を予約したユーザのスマートフォン40に対してアクセス用情報を送信する。ユーザはアクセス用情報に基づきスマートフォン40にて情報提供システムへアクセスし、管理サーバ30はアクセス用情報に基づくユーザのスマートフォン40からのアクセスを受け付ける。これにより情報提供システムは、宿泊施設に宿泊しているユーザのみでなく、宿泊前のユーザに対しても情報を提供することができる。ユーザは、宿泊前に宿泊施設に関する種々の情報を取得し、旅行等に備えることができる。 The information providing system according to this embodiment also provides information to the user before staying. For example, the reservation server 80 transmits access information to the smartphone 40 of the user who has reserved accommodation at the accommodation facility. The user accesses the information providing system with the smartphone 40 based on the access information, and the management server 30 accepts the access from the user's smartphone 40 based on the access information. As a result, the information providing system can provide information not only to the users staying at the accommodation facility but also to the users before staying at the accommodation facility. The user can acquire various information about the accommodation facility before staying and prepare for a trip or the like.
 なお本実施の形態においては、宿泊施設に設置されるユーザインタフェースをスマートスピーカ20としたが、これに限るものではない。ユーザインタフェースは、マイク及びスピーカによる音声入出力を行うことが可能な装置であればどのようなものであってもよい。例えば、パーソナルコンピュータにマイク及びスピーカを接続したものをユーザインタフェースとして用いることができる。また本実施の形態においては、ユーザが利用する端末装置をスマートフォン40としたが、これに限るものではない。端末装置は、例えばパーソナルコンピュータ、タブレット型端末装置又は携帯電話機等のように、テキスト入出力が可能な装置であればどのようなものであってもよい。 In the present embodiment, the user interface installed in the accommodation facility is the smart speaker 20, but the present invention is not limited to this. The user interface may be any device as long as it can perform audio input / output by a microphone and a speaker. For example, a personal computer to which a microphone and a speaker are connected can be used as a user interface. Further, in the present embodiment, the terminal device used by the user is the smartphone 40, but the present invention is not limited to this. The terminal device may be any device capable of input / output text, such as a personal computer, a tablet terminal device, or a mobile phone.
 また本実施の形態においては、音声応答からチャットボットの自動応答への切り替え、及び、チャットボットの自動応答からオペレータの有人応答への切り替えを行う前に、ユーザへ切り替えの可否を問い合せる構成としたが、これに限るものではない。管理サーバ30は、ユーザへの問い合わせを行うことなく、これらの切り替えを自動的に行ってもよい。また本実施の形態においては、スマートスピーカ20がアクセス用情報20aを二次元コードとして表示する構成としたが、これに限るものではない。例えばアクセス用情報20aは、例えばURL、ログインID及びパスワード等の情報を含む文字情報としてスマートスピーカ20が表示してもよい。アクセス用情報20aの出力方法はどのようなものであってもよい。 Further, in the present embodiment, before switching from the voice response to the chatbot automatic response and switching from the chatbot automatic response to the operator's manned response, the user is inquired whether or not the switch is possible. However, it is not limited to this. The management server 30 may automatically perform these switching without making an inquiry to the user. Further, in the present embodiment, the smart speaker 20 is configured to display the access information 20a as a two-dimensional code, but the present invention is not limited to this. For example, the access information 20a may be displayed by the smart speaker 20 as character information including information such as a URL, a login ID, and a password. The output method of the access information 20a may be any.
 また本実施の形態においては、管理サーバ30、音声処理サーバ50、翻訳サーバ60及び予約サーバ80の複数のサーバ装置が処理を分担して情報提供システムを実現しているが、これらの処理分担は上述のものに限らない。例えばこれら複数のサーバ装置は1つのサーバ装置として実現されてもよく、また例えば5つ以上のサーバ装置により処理を分担してもよい。例えば管理サーバ30は、音声処理サーバ50又は翻訳サーバ60の処理を行ってもよい。また例えば管理サーバ30は、予約サーバ80の処理を行ってもよい。また意図識別モデル81の保持及び学習等を行うサーバ装置を、管理サーバ30とは別の装置としてもよい。 Further, in the present embodiment, a plurality of server devices of the management server 30, the voice processing server 50, the translation server 60, and the reservation server 80 share the processing to realize the information providing system. Not limited to the above. For example, these plurality of server devices may be realized as one server device, or the processing may be shared by, for example, five or more server devices. For example, the management server 30 may perform processing of the voice processing server 50 or the translation server 60. Further, for example, the management server 30 may perform the processing of the reservation server 80. Further, the server device that holds and learns the intention identification model 81 may be a device different from the management server 30.
 また本実施の形態において管理サーバ30は、意図識別モデル81及び応答情報DB82を用いてリクエストに対する応答を行う構成としたが、これに限るものではない。例えば管理サーバ30は、リクエストのテキスト情報の入力を受け付けて、リクエストに対する応答情報を出力する学習済モデルを用いてもよい。また例えば管理サーバ30は、学習済モデルを用いず、リクエストのテキスト情報と応答情報とが対応付けられたデータベースを用いてもよい。管理サーバ30がリクエストに対して応答を行う方法は、どのようなものであってもよい。 Further, in the present embodiment, the management server 30 is configured to respond to the request by using the intention identification model 81 and the response information DB 82, but the present invention is not limited to this. For example, the management server 30 may use a trained model that accepts the input of the text information of the request and outputs the response information to the request. Further, for example, the management server 30 may use a database in which the text information of the request and the response information are associated with each other without using the trained model. The management server 30 may respond to the request in any way.
<実施の形態2>
 上述の実施の形態に係る情報提供システムにおいては、まずスマートスピーカ20がユーザのリクエストに対する音声応答を行い、適切な音声応答ができないと判断された場合にチャットボットによる応答への切り替えが行われ、チャットボットにより適切な応答ができないと更に判断された場合にオペレータによる有人応答への切り替えが行われる。これに対して実施の形態2に係る情報提供システムでは、スマートスピーカ20が適切な音声応答を行うことができないと判断された場合、チャットボットによる応答又はオペレータによる有人応答のいずれへ切り替えるかをユーザが選択することができる。
<Embodiment 2>
In the information providing system according to the above-described embodiment, the smart speaker 20 first makes a voice response to the user's request, and when it is determined that an appropriate voice response cannot be made, the chatbot switches to the response. If the chatbot further determines that an appropriate response cannot be made, the operator switches to a manned response. On the other hand, in the information providing system according to the second embodiment, when it is determined that the smart speaker 20 cannot perform an appropriate voice response, the user switches between a chatbot response and an operator manned response. Can be selected.
 実施の形態2に係る情報提供システムでは、宿泊施設に宿泊したユーザは、客室に備えられたスマートスピーカ20に対して発話によるリクエストを行うことにより、このリクエストに対する応答の情報をスマートスピーカ20の音声出力により得ることができる。この場合に情報提供システムのスマートスピーカ20、管理サーバ30及び音声処理サーバ50等によって行われる音声応答の処理は、実施の形態1にて説明したものと同じであってよい。 In the information providing system according to the second embodiment, the user staying at the accommodation facility makes a request by utterance to the smart speaker 20 provided in the guest room, and the information of the response to the request is transmitted by the voice of the smart speaker 20. It can be obtained from the output. In this case, the processing of the voice response performed by the smart speaker 20, the management server 30, the voice processing server 50, etc. of the information providing system may be the same as that described in the first embodiment.
 例えばユーザが発話した言語が音声応答の対応言語に含まれていない場合など、適切な音声応答による情報提供を行うことができないと判断した場合、実施の形態2に係る管理サーバ30は、チャットボットによる応答又はオペレータによる有人応答への切り替えをユーザに促す。このときに管理サーバ30は、チャットボットによる応答へ切り替えるための情報と、有人応答へ切り替えるための情報とをスマートスピーカ20の表示部25aに表示させ、ユーザにいずれの応答へ切り替えるかを選択させる。 When it is determined that the information cannot be provided by an appropriate voice response, for example, when the language spoken by the user is not included in the language supported by the voice response, the management server 30 according to the second embodiment is a chatbot. Prompts the user to switch to a response by or a manned response by the operator. At this time, the management server 30 displays the information for switching to the response by the chatbot and the information for switching to the manned response on the display unit 25a of the smart speaker 20, and allows the user to select which response to switch to. ..
 図20は、実施の形態2に係るスマートスピーカ20に表示されるアクセス用情報の一例を示す模式図である。実施の形態2に係る情報提供システムにおいてスマートスピーカ20は、チャットボットでの応答へ切り替えるためのアクセス用情報20aと、コールセンターの有人応答へ切りけるためのアクセス用情報20bとをタッチパネル25に表示する。 FIG. 20 is a schematic diagram showing an example of access information displayed on the smart speaker 20 according to the second embodiment. In the information providing system according to the second embodiment, the smart speaker 20 displays on the touch panel 25 the access information 20a for switching to the response by the chatbot and the access information 20b for switching to the manned response of the call center. ..
 図示の例では、スマートスピーカ20は、略円形のタッチパネル25の上部に「ご希望される応答方法のコードを読み取ってください。」のメッセージを表示し、このメッセージの下方にチャットボット用のアクセス用情報20aと、コールセンター用のアクセス用情報20bとが左右に並べて表示されている。 In the illustrated example, the smart speaker 20 displays the message "Please read the code of the desired response method" at the top of the substantially circular touch panel 25, and below this message is for access for the chatbot. The information 20a and the access information 20b for the call center are displayed side by side.
 なお図20に示したアクセス用情報20a,20bの表示方法は一例であって、これに限るものではない。例えばスマートスピーカ20は、チャットボット又はコールセンターのいずれかの選択を受け付けるボタン又はアイコン等の選択肢をタッチパネル25に表示し、この選択肢に対するタッチ操作に基づいてユーザから応答方法の選択を受け付け、選択され応答方法に対応するアクセス用情報20a,20bのみをタッチパネル25に表示してもよい。また例えばスマートスピーカ20は、アクセス用情報20bに代えて、コールセンターへアクセスするための電話番号等の情報を表示してもよい。 The display method of the access information 20a and 20b shown in FIG. 20 is an example, and is not limited to this. For example, the smart speaker 20 displays options such as a button or an icon that accepts a selection of either a chatbot or a call center on the touch panel 25, accepts a selection of a response method from the user based on a touch operation for the option, and selects and responds. Only the access information 20a and 20b corresponding to the method may be displayed on the touch panel 25. Further, for example, the smart speaker 20 may display information such as a telephone number for accessing the call center instead of the access information 20b.
 ユーザは、スマートスピーカ20に表示された2つのアクセス用情報20a,20bのいずれかをスマートフォン40のカメラで読み取ることで、応答方法を選択することができる。チャットボットによる応答のためのアクセス用情報20aを読み取った場合、スマートフォン40は、アクセス用情報20aからURL等の情報を取得してチャットボットにアクセスし、チャットボットによる応答のサービスをユーザに提供する。チャットボットによる応答については、上述の実施の形態と同様であるため、詳細な説明は省略する。 The user can select the response method by reading either of the two access information 20a and 20b displayed on the smart speaker 20 with the camera of the smartphone 40. When the access information 20a for the response by the chatbot is read, the smartphone 40 acquires information such as a URL from the access information 20a, accesses the chatbot, and provides the user with a response service by the chatbot. .. Since the response by the chatbot is the same as that of the above-described embodiment, detailed description thereof will be omitted.
 コールセンターによる有人応答のためのアクセス用情報20bを読み取った場合、スマートフォン40は、アクセス用情報20bから電話番号又はURL等の情報を取得して、コールセンターへのアクセスを行う(即ちコールセンターへ電話をかける)。実施の形態2においては、スマートフォン40にてコールセンターへのアクセスを行うことによって、コールセンターのオペレータとユーザとがスマートフォン40を介した音声通信(通話)を行うことができる。ただしオペレータ及びユーザが音声通信を行うのではなく、上述の実施の形態にて説明したオペレータ端末70及び管理サーバ30等を介したオペレータとのチャットを行ってもよい。 When the access information 20b for a manned response by the call center is read, the smartphone 40 acquires information such as a telephone number or URL from the access information 20b and accesses the call center (that is, makes a call to the call center). ). In the second embodiment, by accessing the call center with the smartphone 40, the operator and the user of the call center can perform voice communication (call) via the smartphone 40. However, the operator and the user may not perform voice communication, but may chat with the operator via the operator terminal 70, the management server 30, and the like described in the above-described embodiment.
 実施の形態2に係る情報提供システムのコールセンターには、複数の言語に対応するため、各言語に対応するオペレータが複数常駐している。スマートスピーカ20は、ユーザが使用する言語を判定し、この言語に対応したオペレータと音声通信を行うことができるアクセス用情報20bを表示する。なお、ユーザが使用する言語の判定は、スマートスピーカ20が行ってもよく、管理サーバ30が行ってもよく、音声処理サーバ50が行ってもよく、これら以外の装置が行ってもよい。 In the call center of the information providing system according to the second embodiment, a plurality of operators corresponding to each language are resident in order to support a plurality of languages. The smart speaker 20 determines a language used by the user, and displays access information 20b capable of performing voice communication with an operator corresponding to this language. The language used by the user may be determined by the smart speaker 20, the management server 30, the voice processing server 50, or other devices.
 ユーザが使用する言語を判定する方法には、いくつかの方法が採用され得る。以下に言語判定の方法を複数説明するが、情報提供システムはこれら複数の方法のうちの1つ又は複数を用いて言語判定を行ってよい。 Several methods can be adopted as a method for determining the language used by the user. Although a plurality of language determination methods will be described below, the information providing system may perform language determination using one or more of these plurality of methods.
(1)ユーザの発話から判定
 ユーザがスマートスピーカ20に対して発した言葉、単語又は文章等の内容に基づいて、ユーザが使用する言語を判定する。ユーザの発話に係る音声情報はスマートスピーカ20から音声処理サーバ50へ送信され、音声処理サーバ50は処理対象の言語について音声情報をテキスト情報に変換することができる。このため音声処理サーバ50は、与えられた音声情報が処理対象の言語であるか否か、複数の言語に対応していれば音声情報が処理対象の言語のいずれであるかを判定することができる。音声情報が処理対象の言語でない場合、例えば音声処理サーバ50がテキスト情報への変換は行わずに言語の判定だけを行ってもよく、また例えば音声処理サーバ50が処理できない音声情報を管理サーバ30へ送信し、管理サーバ30が音声情報から言語の判定を行ってもよく、また例えばスマートスピーカ20が音声情報から言語の判定を行ってもよい。
(1) Judgment from the user's utterance The language used by the user is determined based on the content of words, words, sentences, etc. that the user has uttered to the smart speaker 20. The voice information related to the user's utterance is transmitted from the smart speaker 20 to the voice processing server 50, and the voice processing server 50 can convert the voice information into text information for the language to be processed. Therefore, the voice processing server 50 can determine whether or not the given voice information is the language to be processed, and if the voice information corresponds to a plurality of languages, which of the languages to be processed is the voice information. it can. When the voice information is not the language to be processed, for example, the voice processing server 50 may only determine the language without converting it into text information, and for example, the management server 30 may handle voice information that the voice processing server 50 cannot process. The management server 30 may determine the language from the voice information, or the smart speaker 20 may determine the language from the voice information, for example.
 音声情報から言語を判定するには、例えば音声情報とその言語種別とが対応付けられた教師データを用いて機械学習がなされた学習モデルが用いられ得る。この学習モデルは、音声処理サーバ50の処理対象の言語よりも多くの言語について学習がなされ、音声情報を入力として受け付けて、この音声情報の言語の種別に関する情報を出力する。音声処理サーバ50、管理サーバ30又はスマートスピーカ20は、この学習モデルを予め記憶しており、音声処理サーバ50の処理対象ではない言語について、音声情報に基づく言語の判定を行うことができる。 In order to determine a language from voice information, for example, a learning model in which machine learning is performed using teacher data in which voice information and its language type are associated can be used. This learning model learns more languages than the language to be processed by the voice processing server 50, accepts voice information as input, and outputs information regarding the language type of the voice information. The voice processing server 50, the management server 30, or the smart speaker 20 stores this learning model in advance, and can determine a language based on voice information for a language that is not a processing target of the voice processing server 50.
(2)宿泊者情報から判定
 管理サーバ30又は宿泊施設のコンピュータ等が宿泊者に関する情報を管理するデータベース等を備える場合、このデータベースに記憶された情報を基に、ユーザが使用する言語を判定することができる。例えばユーザの宿泊者情報として、ユーザの住所又は出身地等の国又は場所等に関する情報がデータベースに記憶されている場合、この国又は場所等に対応する言語をユーザが使用する言語と判定することができる。
(2) Judgment from guest information When the management server 30 or the computer of the accommodation facility has a database or the like that manages information about the guest, the language used by the user is determined based on the information stored in this database. be able to. For example, when information about a country or place such as a user's address or birthplace is stored in a database as a user's guest information, the language corresponding to this country or place is determined to be the language used by the user. Can be done.
 なお、宿泊者情報の管理には、PMS(Property Management System)と呼ばれるホテル管理システムが利用され得る。スマートスピーカ20又は管理サーバ30は、PMSのサーバとの間で通信を行い、宿泊しているユーザに関する情報を取得して、このユーザが使用する言語を判定することができる。 A hotel management system called PMS (Property Management System) can be used to manage guest information. The smart speaker 20 or the management server 30 can communicate with the PMS server, acquire information about the staying user, and determine the language used by this user.
(3)スマートフォンの情報から判定
 ユーザが所持しているスマートフォン40に設定されている言語に関する情報を取得し、取得した情報を基にユーザが使用する言語を判定することができる。例えばスマートスピーカ20は、NFC(Near Field Communication)、Bluetooth(登録商標)又は無線LAN等の無線通信を利用してスマートフォン40との間で情報の授受を行うことができる。スマートフォン40にはユーザが使用する言語に関する設定が予めなされており、スマートスピーカ20はスマートフォン40からこの言語設定の情報を無線通信により取得して、ユーザが使用する言語を判定する。
(3) Judgment from smartphone information Information on the language set in the smartphone 40 possessed by the user can be acquired, and the language used by the user can be determined based on the acquired information. For example, the smart speaker 20 can exchange information with and from the smartphone 40 by using wireless communication such as NFC (Near Field Communication), Bluetooth (registered trademark), or wireless LAN. The smartphone 40 is set in advance regarding the language used by the user, and the smart speaker 20 acquires the information of the language setting from the smartphone 40 by wireless communication to determine the language used by the user.
 なおスマートフォン40の設定情報を利用する場合、スマートスピーカ20が言語を判定するのではなく、スマートフォン40が判定を行ってもよい。即ち、スマートスピーカ20に表示されたアクセス用情報20bを読み取ったスマートフォン40は、アクセス用情報20bに基づいてコールセンターへのアクセスを行う際に、自身に設定された言語に応じてアクセス先を決定してもよい。 When using the setting information of the smartphone 40, the smartphone 40 may make the determination instead of the smart speaker 20 determining the language. That is, the smartphone 40 that has read the access information 20b displayed on the smart speaker 20 determines the access destination according to the language set for itself when accessing the call center based on the access information 20b. You may.
(4)ユーザによる選択
 スマートスピーカ20は、ユーザから言語の選択操作を受け付けて、ユーザが使用する言語を判定することができる。スマートスピーカ20は、例えば図4に示したスマートフォン40の言語選択ボックスと同様のものをタッチパネル25に表示し、ユーザによる言語選択をタッチパネル25にて受け付ける。なおスマートスピーカ20による言語選択の受け付けは、適宜のタイミングで行われてよい。スマートスピーカ20は、例えばユーザが宿泊施設を訪れて最初にスマートスピーカ20を使用するタイミングで言語選択の受け付けを行ってよく、また例えばコールセンターへのアクセスを行うためのアクセス用情報20bを表示する直前のタイミングで言語選択を受け付けてもよい。
(4) Selection by User The smart speaker 20 can receive a language selection operation from the user and determine the language used by the user. For example, the smart speaker 20 displays the same language selection box of the smartphone 40 shown in FIG. 4 on the touch panel 25, and accepts the language selection by the user on the touch panel 25. The smart speaker 20 may accept the language selection at an appropriate timing. The smart speaker 20 may accept language selection at the timing when the user first visits the accommodation facility and uses the smart speaker 20, for example, immediately before displaying the access information 20b for accessing the call center. Language selection may be accepted at the timing of.
 実施の形態2に係るスマートスピーカ20、管理サーバ30、スマートフォン40又は音声処理サーバ50等の装置は、上記の(1)~(4)のいずれか1つを採用して又は複数を適宜に組み合わせて、ユーザが使用する言語を判定する。スマートスピーカ20は、使用言語の判定結果に応じたアクセス用情報20bをタッチパネル25に表示する。このアクセス用情報20bをスマートフォン40にて読み取ったユーザは、アクセス用情報20bを利用してコールセンターにアクセスすることによって、自身が使用する言語でオペレータとの音声通信を行うことができる。 As the device such as the smart speaker 20, the management server 30, the smartphone 40, or the voice processing server 50 according to the second embodiment, any one of the above (1) to (4) is adopted or a plurality of devices are appropriately combined. To determine the language used by the user. The smart speaker 20 displays the access information 20b according to the determination result of the language used on the touch panel 25. A user who has read the access information 20b with the smartphone 40 can perform voice communication with the operator in the language used by the user by accessing the call center using the access information 20b.
 なお実施の形態2に係る情報提供システムでは、スマートスピーカ20、管理サーバ30、スマートフォン40又は音声処理サーバ50等の装置が、アクセス用情報20bを表示する前に、ユーザが使用する言語を判定する構成であるが、これに限るものではない。例えば、スマートスピーカ20が表示したアクセス用情報20bをスマートフォン40にて読み取り、ユーザがアクセス用情報20bを利用してスマートフォン40でコールセンターへアクセスを行った後に、ユーザが使用する言語を判定してもよい。この場合、例えばコールセンターを管理するサーバ等の装置が、上記の(1)~(4)のいずれか1つを採用して又は複数を適宜に組み合わせてユーザの使用する言語を判定し、判定結果に応じたオペレータとユーザとの間で音声通信を行わせる。 In the information providing system according to the second embodiment, the device such as the smart speaker 20, the management server 30, the smartphone 40, or the voice processing server 50 determines the language used by the user before displaying the access information 20b. It is a configuration, but it is not limited to this. For example, even if the access information 20b displayed by the smart speaker 20 is read by the smartphone 40 and the user accesses the call center with the smartphone 40 using the access information 20b, the language used by the user is determined. Good. In this case, for example, a device such as a server that manages a call center determines the language used by the user by adopting any one of (1) to (4) above or by appropriately combining a plurality of the above (1) to (4), and the determination result is obtained. Voice communication is performed between the operator and the user according to the above.
 またスマートスピーカ20が表示するアクセス用情報20bは、例えば宿泊施設毎、宿泊施設の部屋毎又は宿泊施設の所在地毎等により異なる情報が表示されてもよい。これによりコールセンターでは、スマートフォン40が掛けた電話番号に応じて、いずれの宿泊施設等からの問い合わせであるかを判断することができる。またアクセス用情報20bは、例えば時間又は曜日等に応じて異なる情報が表示されてもよく、また例えばコールセンターが営業しているか否か又はオペレータが在席しているか否か等に応じて異なる情報が表示されてもよい。 Further, the access information 20b displayed by the smart speaker 20 may display different information depending on, for example, each accommodation facility, each room of the accommodation facility, each location of the accommodation facility, and the like. As a result, the call center can determine which accommodation facility or the like is inquiring according to the telephone number dialed by the smartphone 40. Further, the access information 20b may display different information depending on, for example, the time or day of the week, and different information depending on, for example, whether or not the call center is open or whether or not the operator is present. May be displayed.
 図21は、実施の形態2に係るスマートスピーカ20が行う処理の手順を示すフローチャートである。実施の形態2に係るスマートスピーカ20の処理部21は、ユーザからのリクエストに関する音声情報を音声処理サーバ50へ送信すると共に、音声処理サーバ50から受信した音声情報をスピーカ27から出力する音声応答処理を行う(ステップS201)。 FIG. 21 is a flowchart showing a procedure of processing performed by the smart speaker 20 according to the second embodiment. The processing unit 21 of the smart speaker 20 according to the second embodiment transmits voice information related to a request from a user to the voice processing server 50, and outputs voice information received from the voice processing server 50 from the speaker 27. (Step S201).
 処理部21は、音声応答処理を行っている際に、音声応答からチャットボットによる応答又はコールセンターによる応答への切り替えを指示する切替指示が音声処理サーバ50から与えられたか否かを判定する(ステップS202)。切替指示が与えられていない場合(S202:NO)、処理部21は、ステップS201へ処理を戻し、音声応答処理を継続する。 The processing unit 21 determines whether or not a switching instruction for instructing switching from the voice response to the response by the chatbot or the response by the call center is given from the voice processing server 50 during the voice response processing (step). S202). When the switching instruction is not given (S202: NO), the processing unit 21 returns the processing to step S201 and continues the voice response processing.
 切替指示が与えられた場合(S202:YES)、切り替えの可否を問い合わせるメッセージ等をスピーカ27から音声出力するか又はタッチパネル25に表示することにより、切り替えの可否をユーザに問い合わせる(ステップS203)。処理部21は、この問い合わせに対して切り替えを肯定する回答が得られたか否かを判定する(ステップS204)。切り替えを肯定する回答が得られなかった場合(S204:NO)、処理部21は、ステップS201へ処理を戻し、音声応答処理を継続する。 When a switching instruction is given (S202: YES), the user is inquired about whether or not switching is possible by outputting a voice message or the like inquiring whether or not switching is possible from the speaker 27 or displaying it on the touch panel 25 (step S203). The processing unit 21 determines whether or not an answer affirming the switching has been obtained in response to this inquiry (step S204). When an answer affirming the switching is not obtained (S204: NO), the processing unit 21 returns the processing to step S201 and continues the voice response processing.
 切り替えを肯定する回答が得られた場合(S204:YES)、処理部21は、ユーザが使用する言語を判定する(ステップS205)。なお言語判定は、これ以前のタイミングで行われてもよい。またスマートスピーカ20以外の装置で言語判定を行ってもよく、この場合にはスマートスピーカ20の処理部21は、言語判定を行った装置から直接的又は間接的に判定結果を取得する。 When an answer affirming the switching is obtained (S204: YES), the processing unit 21 determines the language used by the user (step S205). The language determination may be performed at a timing earlier than this. Further, the language determination may be performed by a device other than the smart speaker 20, and in this case, the processing unit 21 of the smart speaker 20 directly or indirectly acquires the determination result from the device that has performed the language determination.
 次いで処理部21は、図11に示すように、チャットボットによる応答への切り替えを行うためのアクセス用情報20aと、コールセンターによる有人応答への切り替えを行うためのアクセス用情報20bとを、タッチパネル25に表示して(ステップS206)、処理を終了する。 Next, as shown in FIG. 11, the processing unit 21 displays the access information 20a for switching to the response by the chatbot and the access information 20b for switching to the manned response by the call center on the touch panel 25. Is displayed (step S206), and the process ends.
 図22は、実施の形態2に係るスマートフォン40が行う音声通信開始処理の手順を示すフローチャートである。実施の形態2に係るスマートフォン40の処理部41は、例えばユーザの操作に応じてアプリケーションプログラムを起動し、二次元コードの読み取りを行うモードへと移行する(ステップS211)。本モードにおいて処理部41のアクセス用情報取得部41aは、スマートスピーカ20が表示したアクセス用情報20bをカメラ45にて撮像することにより、コールセンターによる有人応答への切り替えを行うためのアクセス用情報20bを取得する(ステップS212)。アクセス用情報取得部41aは、取得したアクセス用情報20bに含まれる電話番号等の情報を取得する(ステップS213)。 FIG. 22 is a flowchart showing a procedure of voice communication start processing performed by the smartphone 40 according to the second embodiment. The processing unit 41 of the smartphone 40 according to the second embodiment starts an application program in response to, for example, a user operation, and shifts to a mode for reading a two-dimensional code (step S211). In this mode, the access information acquisition unit 41a of the processing unit 41 captures the access information 20b displayed by the smart speaker 20 with the camera 45, so that the access information 20b for switching to the manned response by the call center is performed. (Step S212). The access information acquisition unit 41a acquires information such as a telephone number included in the acquired access information 20b (step S213).
 次いで処理部41は、取得した電話番号等の情報に基づいて、コールセンターへのアクセスを行う(ステップS214)。処理部41は、ユーザ及びコールセンターの間での音声通信を開始し(ステップS215)、処理を終了する。 Next, the processing unit 41 accesses the call center based on the acquired information such as the telephone number (step S214). The processing unit 41 starts voice communication between the user and the call center (step S215), and ends the processing.
 以上の構成の実施の形態2に係る情報提供システムは、宿泊施設の客室に設置されたスマートスピーカ20にてユーザからの音声入力に基づくリクエストを取得し、取得したリクエストに対する応答情報をスマートスピーカ20にて音声出力する。取得したリクエストに対する応答情報の出力を行うことができない場合、管理サーバ30及びスマートスピーカ20は、テキスト入出力による自動応答、又は、オペレータによる有人応答のいずれかへの切り替えを行う。 In the information providing system according to the second embodiment of the above configuration, the smart speaker 20 installed in the guest room of the accommodation facility acquires a request based on the voice input from the user, and the response information to the acquired request is obtained by the smart speaker 20. Output audio with. When the response information for the acquired request cannot be output, the management server 30 and the smart speaker 20 switch to either an automatic response by text input / output or a manned response by the operator.
 このときにスマートスピーカ20は、テキスト入出力による自動応答へ切り替えるためのアクセス用情報20aと、オペレータによる有人応答へ切り替えるためのアクセス用情報20bとを表示し、いずれの切り替えを行うかをユーザに選択させる。ユーザは、自身のスマートフォン40にていずれかのアクセス用情報20a,20bを読み取ることで、いずれの応答方法に切り替えるかを選択することができる。 At this time, the smart speaker 20 displays the access information 20a for switching to the automatic response by text input / output and the access information 20b for switching to the manned response by the operator, and tells the user which switching is to be performed. Let me choose. The user can select which response method to switch to by reading any of the access information 20a or 20b with his / her smartphone 40.
 これによりユーザは、スマートスピーカ20を利用した音声応答で所望の結果が得られない場合等に、チャットボットによる応答又はオペレータによる有人応答のいずれかを選択して切り替えることができる。ユーザが自身の好み等に応じた応答方法への切り替えを行うことができるため、ユーザに適した情報提供を行うことが期待できる。 As a result, the user can select and switch between the response by the chatbot and the manned response by the operator when the desired result cannot be obtained by the voice response using the smart speaker 20. Since the user can switch to the response method according to his / her own preference, it can be expected to provide information suitable for the user.
 また実施の形態2に係る情報提供システムは、ユーザが使用する言語の種別を判定し、判定した言語で対応できるオペレータによる有人応答への切り替えを行う。これによりユーザは、オペレータとの音声通信等を円滑に行うことができる。 Further, the information providing system according to the second embodiment determines the type of language used by the user, and switches to a manned response by an operator who can respond in the determined language. As a result, the user can smoothly perform voice communication with the operator.
 なお、実施の形態2に係る情報提供システムでは、スマートスピーカ20が表示したアクセス用情報20bをスマートフォン40にて読み取り、ユーザがスマートフォン40を用いてコールセンターのオペレータとの音声通信を行う構成であるが、これに限るものではない。例えばコールセンターのオペレータによる有人応答への切り替えがユーザにより選択された場合に、スマートスピーカ20がコールセンターへのアクセスを行い、スマートスピーカ20がユーザ及びオペレータによる音声通信を行ってもよい。 The information providing system according to the second embodiment has a configuration in which the access information 20b displayed by the smart speaker 20 is read by the smartphone 40, and the user performs voice communication with the call center operator using the smartphone 40. , Not limited to this. For example, when the switch to the manned response by the call center operator is selected by the user, the smart speaker 20 may access the call center and the smart speaker 20 may perform voice communication by the user and the operator.
 また実施の形態2に係る情報提供システムでは、オペレータ端末70を利用したチャットによるオペレータの有人応答の昨日は備えていなくてもよい。 Further, the information providing system according to the second embodiment does not have to be provided yesterday for the operator's manned response by chat using the operator terminal 70.
 また、実施の形態2に係る情報提供システムのその他の構成は、先の実施の形態に係る情報提供システムと同様であるため、同様の箇所には同じ符号を付し、詳細な説明を省略する。 Further, since the other configurations of the information providing system according to the second embodiment are the same as those of the information providing system according to the previous embodiment, the same reference numerals are given to the same parts, and detailed description thereof will be omitted. ..
<実施の形態3>
 上述の実施の形態2に係る情報提供システムは、音声によるリクエストに対する応答情報の出力を行うことができない場合に、テキスト入出力による自動応答、又は、オペレータによる有人応答のいずれかへの切り替えを行っている。これに対して実施の形態3に係る情報提供システムは、音声によるリクエストに対する応答情報の出力を行うことができない場合に、まず、テキスト入出力による自動応答への切り替えを行う。これは実施の形態1に係る情報提供システムと同様の手順で行うことができる。
<Embodiment 3>
The information providing system according to the second embodiment described above switches to either an automatic response by text input / output or a manned response by an operator when it is not possible to output response information to a request by voice. ing. On the other hand, the information providing system according to the third embodiment first switches to the automatic response by text input / output when the response information for the request by voice cannot be output. This can be performed in the same procedure as the information providing system according to the first embodiment.
 上述の実施の形態1に係る情報提供システムは、テキスト入出力による自動応答でユーザのリクエストに応答できない場合、テキスト入出力によるオペレータの有人応答への切り替えを行っている。これに対して実施の形態3に係る情報提供システムは、テキスト入出力による自動応答でユーザのリクエストに応答できない場合には、実施の形態1に係る情報提供システムと同様のテキスト入出力による有人応答へ切り替えるか、又は、コールセンターのオペレータによる音声での有人応答へ切り替えるかの選択をユーザから受け付ける。 The information providing system according to the first embodiment described above switches to the operator's manned response by text input / output when the automatic response by text input / output cannot respond to the user's request. On the other hand, when the information providing system according to the third embodiment cannot respond to the user's request by the automatic response by the text input / output, the information providing system according to the first embodiment has the same manned response by the text input / output. The user accepts the option of switching to or switching to a voice manned response by the call center operator.
 実施の形態3に係る情報提供システムの管理サーバ30は、ユーザからのリクエストの意図が識別不可能であった場合、又は、リクエストの意図に対する応答情報が得られなかった場合、チャットボットの自動応答からオペレータによる有人応答への切替指示をユーザのスマートフォン40へ送信する。これは、例えば図16に示したフローチャートのステップS78に示された処理に相当する。 The management server 30 of the information providing system according to the third embodiment automatically responds to the chatbot when the intention of the request from the user cannot be identified or when the response information for the intention of the request cannot be obtained. Sends the operator's instruction to switch to the manned response to the user's smartphone 40. This corresponds to, for example, the process shown in step S78 of the flowchart shown in FIG.
 実施の形態3に係る情報提供システムのスマートフォン40は、テキスト入出力によるチャットボットの自動応答を行っている際に、管理サーバ30から切替指示を受信したか否かの判定を行う。これは、例えば図15に示したフローチャートのステップS54に示された処理に相当する。 The smartphone 40 of the information providing system according to the third embodiment determines whether or not a switching instruction has been received from the management server 30 while performing an automatic response of the chatbot by text input / output. This corresponds to, for example, the process shown in step S54 of the flowchart shown in FIG.
 管理サーバ30から切替指示を受信した場合、実施の形態3に係るスマートフォン40は、オペレータのテキスト入出力による有人応答へ切り替えるか、又は、コールセンターのオペレータの音声入出力による有人応答へ切り替えるかの選択を受け付ける。このときにスマートフォン40は、タッチパネル44に例えば有人応答へ切り替える旨のメッセージを表示すると共に、「チャット(テキスト入出力)」のラベルが付されたボタンと、「コールセンター(音声入出力)」のラベルが付されたボタンとを並べて表示する。スマートフォン40は、いずれかのボタンに対するタッチ操作を受け付けることにより、テキスト入出力による有人応答又は音声入出力による有人応答の選択を受け付ける。 When the switching instruction is received from the management server 30, the smartphone 40 according to the third embodiment selects whether to switch to the manned response by the operator's text input / output or to the manned response by the call center operator's voice input / output. Accept. At this time, the smartphone 40 displays a message on the touch panel 44 to switch to a manned response, for example, a button labeled "chat (text input / output)", and a label "call center (voice input / output)". The buttons marked with are displayed side by side. By accepting a touch operation on any of the buttons, the smartphone 40 accepts the selection of a manned response by text input / output or a manned response by voice input / output.
 テキスト入出力による有人応答への切り替えが選択された場合、スマートフォン40は、テキスト入出力による有人応答への切り替えの要求を管理サーバ30へ送信する。これは、例えば図15に示したフローチャートのステップS58に示された処理に相当する。スマートフォン40からの要求を受信した管理サーバ30は、例えば図17に示したフローチャートの処理により、テキスト入出力による有人応答を行う。 When switching to a manned response by text input / output is selected, the smartphone 40 transmits a request for switching to a manned response by text input / output to the management server 30. This corresponds to, for example, the process shown in step S58 of the flowchart shown in FIG. The management server 30 that has received the request from the smartphone 40 performs a manned response by text input / output, for example, by processing the flowchart shown in FIG.
 音声入出力による有人応答への切り替えが選択された場合、実施の形態3に係るスマートフォン40は、音声入出力による有人応答への切り替えの要求を管理サーバ30へ送信する。この要求を受信した管理サーバ30は、例えばユーザが使用する言語を判定する処理を行って、ユーザに適したコールセンター又はオペレータ等を決定する。管理サーバ30は、決定したオペレータとの通話を行うための電話番号等の情報をスマートフォン40へ送信するか、又は、このオペレータとスマートフォン40との通話接続を確立する処理を行う。スマートフォン40は、管理サーバ30から電話番号等の情報を受信した場合には、受信した情報に基づいてコールセンターへアクセスし、オペレータとの音声通信を開始する。 When switching to the manned response by voice input / output is selected, the smartphone 40 according to the third embodiment transmits a request for switching to the manned response by voice input / output to the management server 30. Upon receiving this request, the management server 30 performs, for example, a process of determining the language used by the user to determine a call center, an operator, or the like suitable for the user. The management server 30 transmits information such as a telephone number for making a call with the determined operator to the smartphone 40, or performs a process of establishing a call connection between the operator and the smartphone 40. When the smartphone 40 receives information such as a telephone number from the management server 30, it accesses the call center based on the received information and starts voice communication with the operator.
 以上の構成の実施の形態3に係る情報提供システムは、テキスト入出力による自動応答ではユーザのリクエストに応答できない場合に、テキスト入出力による有人応答又は音声入出力による有人応答への切り替えの選択をユーザから受け付けて、選択された有人応答への切り替えを行う。 The information providing system according to the third embodiment of the above configuration selects switching to a manned response by text input / output or a manned response by voice input / output when the automatic response by text input / output cannot respond to the user's request. Accepts from the user and switches to the selected manned response.
 これによりユーザは、チャットボットによる自動応答で所望の結果が得られない場合等に、オペレータによる有人応答への切り替えを行うことができ、テキスト入出力又は音声入出力のいずれの有人応答へ切り替えを行うかを選択することができる。ユーザが自身の好み等に応じた応答方法への切り替えを行うことができるため、ユーザに適した情報提供を行うことが期待できる。 As a result, the user can switch to the manned response by the operator when the desired result cannot be obtained by the automatic response by the chatbot, and can switch to either the text input / output or the voice input / output manned response. You can choose whether to do it. Since the user can switch to the response method according to his / her own preference, it can be expected to provide information suitable for the user.
 なお実施の形態3に係る情報提供システムのその他の構成は、先の実施の形態に係る情報提供システムと同様であるため、同様の箇所には同じ符号を付し、詳細な説明を省略する。 Since the other configurations of the information providing system according to the third embodiment are the same as those of the information providing system according to the previous embodiment, the same reference numerals are given to the same parts, and detailed description thereof will be omitted.
 今回開示された実施形態はすべての点で例示であって、制限的なものではないと考えられるべきである。本開示の範囲は、上記した意味ではなく、請求の範囲によって示され、請求の範囲と均等の意味及び範囲内でのすべての変更が含まれることが意図される。 The embodiments disclosed this time should be considered to be exemplary in all respects and not restrictive. The scope of the present disclosure is indicated by the scope of claims, not the above-mentioned meaning, and is intended to include all modifications within the meaning and scope equivalent to the scope of claims.
 10 情報提供システム
 20 スマートスピーカ(ユーザインタフェース)
 20a アクセス用情報
 21 処理部
 22 記憶部
 22a プログラム
 22b 識別情報
 23 通信部
 25 タッチパネル
 25a 表示部
 25b 入力部
 26 マイク
 27 スピーカ
 28 カメラ
 30 管理サーバ(情報提供装置)
 31 処理部
 31a リクエスト取得部
 31b 応答処理部
 31c 切替処理部
 31d 学習処理部
 32 記憶部
 32a サーバプログラム
 33 通信部
 40 スマートフォン(端末装置)
 41 処理部
 41a アクセス用情報取得部
 41b チャット処理部
 42 記憶部
 42a プログラム
 43 通信部
 44 タッチパネル
 45 カメラ
 50 音声処理サーバ
 60 翻訳サーバ
 70 オペレータ端末
 80 予約サーバ
 81 意図識別モデル(識別器)
 82 応答情報DB
 83 オペレータDB
 99 記録媒体
 
10 Information provision system 20 Smart speaker (user interface)
20a Access information 21 Processing unit 22 Storage unit 22a Program 22b Identification information 23 Communication unit 25 Touch panel 25a Display unit 25b Input unit 26 Microphone 27 Speaker 28 Camera 30 Management server (information providing device)
31 Processing unit 31a Request acquisition unit 31b Response processing unit 31c Switching processing unit 31d Learning processing unit 32 Storage unit 32a Server program 33 Communication unit 40 Smartphone (terminal device)
41 Processing unit 41a Access information acquisition unit 41b Chat processing unit 42 Storage unit 42a Program 43 Communication unit 44 Touch panel 45 Camera 50 Voice processing server 60 Translation server 70 Operator terminal 80 Reservation server 81 Intention identification model (identifier)
82 Response information DB
83 Operator DB
99 Recording medium

Claims (17)

  1.  宿泊施設に設置されたユーザインタフェースに入力された音声に基づくリクエストを取得し、
     取得したリクエストに対応する応答情報をデータベースから取得し、
     取得した応答情報を前記ユーザインタフェースに音声で出力させ、
     前記データベースから前記応答情報を取得できない場合に、テキストの入出力による自動応答へ切り替える、情報提供方法。
    Get a request based on the voice input to the user interface installed in the accommodation,
    Obtain the response information corresponding to the obtained request from the database and
    The acquired response information is output to the user interface by voice.
    An information providing method for switching to an automatic response by inputting / outputting text when the response information cannot be obtained from the database.
  2.  前記データベースから前記応答情報を取得できない場合に、テキストの入出力による自動応答へ切り替えるためのアクセス用情報を前記ユーザインタフェースに出力させる、請求項1に記載の情報提供方法。 The information providing method according to claim 1, wherein when the response information cannot be acquired from the database, access information for switching to an automatic response by input / output of text is output to the user interface.
  3.  前記アクセス用情報に基づく端末装置からのアクセスを受け付け、
     前記端末装置に入力されたテキストに基づくリクエストを取得し、
     取得したリクエストに対応する応答情報を前記データベースから取得し、
     取得した前記応答情報を前記端末装置にテキストで出力させる、
     請求項2に記載の情報提供方法。
    Accepting access from the terminal device based on the access information,
    Acquire the request based on the text input to the terminal device, and
    The response information corresponding to the acquired request is acquired from the database, and the response information is acquired.
    The acquired response information is output to the terminal device as a text.
    The information providing method according to claim 2.
  4.  前記データベースからテキストに基づく前記リクエストに対応する応答情報が取得できない場合に、オペレータによる有人応答へ切り替える、請求項3に記載の情報提供方法。 The information providing method according to claim 3, wherein when the response information corresponding to the request based on the text cannot be obtained from the database, the response is switched to the manned response by the operator.
  5.  テキスト入出力による有人応答又は音声入出力による有人応答の選択を受け付け、
     受け付けた選択に応じたオペレータによる有人応答へ切り替える、請求項4に記載の情報提供方法。
    Accepts the selection of manned response by text input / output or manned response by voice input / output,
    The information providing method according to claim 4, wherein the response is switched to a manned response by an operator according to the accepted selection.
  6.  日付、時刻又は言語の情報に基づいて、前記有人応答への切り替え先を決定する、請求項4又は請求項5に記載の情報提供方法。 The information providing method according to claim 4 or 5, wherein the switching destination to the manned response is determined based on the date, time, or language information.
  7.  ユーザのリクエストに係る情報を入力として受け付け、前記ユーザのリクエストの意図を識別して出力するよう学習された識別器を用い、
     前記識別器にユーザのリクエストに係る情報を入力して、前記識別器が出力する前記リクエストの意図を取得し、
     取得したリクエストの意図に対応する応答情報を前記データベースから取得し、
     取得した応答情報を出力させ、
     前記オペレータによる有人応答への切り替えを行った場合に、前記有人応答の結果に基づいて前記識別器を再学習する、
     請求項4から請求項6までのいずれか1つに記載の情報提供方法。
    Using a classifier learned to accept information related to a user's request as input and identify and output the intention of the user's request,
    By inputting the information related to the user's request into the classifier, the intent of the request output by the classifier is acquired.
    The response information corresponding to the intent of the acquired request is acquired from the database, and the response information is acquired.
    Output the acquired response information
    When the operator switches to the manned response, the classifier is relearned based on the result of the manned response.
    The information providing method according to any one of claims 4 to 6.
  8.  前記宿泊施設を識別する識別情報を取得し、
     取得したリクエスト及び識別情報に対応する応答情報を出力する、
     請求項2から請求項7までのいずれか1つに記載の情報提供方法。
    Acquire the identification information that identifies the accommodation facility,
    Output the response information corresponding to the acquired request and identification information,
    The information providing method according to any one of claims 2 to 7.
  9.  前記データベースは、前記宿泊施設毎の応答情報を記憶し、
     前記識別情報に基づいて、前記宿泊施設に対応する応答情報を前記データベースから取得する、
     請求項8に記載の情報提供方法。
    The database stores response information for each accommodation facility, and stores the response information.
    Based on the identification information, the response information corresponding to the accommodation facility is acquired from the database.
    The information providing method according to claim 8.
  10.  前記宿泊施設に宿泊前のユーザに対して前記アクセス用情報を送信し、
     前記アクセス用情報に基づく前記ユーザの端末装置からのアクセスを受け付ける、
     請求項2から請求項9までのいずれか1つに記載の情報提供方法。
    The access information is transmitted to the user before staying at the accommodation facility, and the access information is transmitted to the accommodation facility.
    Accepting access from the user's terminal device based on the access information,
    The information providing method according to any one of claims 2 to 9.
  11.  前記データベースから前記応答情報を取得できない場合に、テキストの入出力による自動応答への切り替えを行うための情報、及び、オペレータによる有人応答への切り替えを行うための情報を選択可能に表示する、
     請求項1から請求項10までのいずれか1つに記載の情報提供方法。
    When the response information cannot be obtained from the database, the information for switching to the automatic response by input / output of text and the information for switching to the manned response by the operator are displayed in a selectable manner.
    The information providing method according to any one of claims 1 to 10.
  12.  前記有人応答のいずれかへの切り替えを行うための情報が選択された場合に、
     ユーザが使用する言語を判定し、
     判定した言語に対応したオペレータによる有人応答への切り替えを行う、
     請求項11に記載の情報提供方法。
    When information is selected to switch to any of the manned responses,
    Determine the language used by the user
    Switch to manned response by the operator corresponding to the judged language,
    The information providing method according to claim 11.
  13.  宿泊施設に設置されるユーザインタフェースと、
     前記ユーザインタフェースに入力された音声に基づくリクエストを取得するリクエスト取得部、前記リクエスト取得部が取得したリクエストに対応する応答情報をデータベースから取得する応答情報取得部、前記応答情報取得部が取得した応答情報を前記ユーザインタフェースに音声で出力させる処理を行う音声出力処理部、及び、前記データベースから前記応答情報を取得できない場合に、テキストの入出力による自動応答へ切り替えに係る処理を行う処理部を有する情報提供装置と
     を備える、情報提供システム。
    The user interface installed in the accommodation and
    A request acquisition unit that acquires a request based on a voice input to the user interface, a response information acquisition unit that acquires response information corresponding to a request acquired by the request acquisition unit from a database, and a response acquired by the response information acquisition unit. It has a voice output processing unit that performs processing to output information to the user interface by voice, and a processing unit that performs processing related to switching to an automatic response by inputting / outputting text when the response information cannot be acquired from the database. An information providing system equipped with an information providing device.
  14.  宿泊施設に設置されたユーザインタフェースに入力された音声に基づくリクエストを取得するリクエスト取得部と、
     前記リクエスト取得部が取得したリクエストに対応する応答情報をデータベースから取得する応答情報取得部と、
     前記応答情報取得部が取得した応答情報を前記ユーザインタフェースに音声で出力させる処理を行う音声出力処理部と、
     前記データベースから前記応答情報を取得できない場合に、テキストの入出力による自動応答へ切り替えに係る処理を行う処理部と
     を備える、情報提供装置。
    A request acquisition unit that acquires requests based on voice input to the user interface installed in the accommodation facility,
    The response information acquisition unit that acquires the response information corresponding to the request acquired by the request acquisition unit from the database, and the response information acquisition unit.
    A voice output processing unit that performs a process of outputting the response information acquired by the response information acquisition unit to the user interface by voice, and a voice output processing unit.
    An information providing device including a processing unit that performs processing related to switching to an automatic response by input / output of text when the response information cannot be acquired from the database.
  15.  コンピュータに、
     宿泊施設に設置されたユーザインタフェースに入力された音声に基づくリクエストを取得し、
     取得したリクエストに対応する応答情報をデータベースから取得し、
     取得した応答情報を前記ユーザインタフェースに音声で出力させ、
     前記データベースから前記応答情報を取得できない場合に、テキストの入出力による自動応答へ切り替える、
     処理を実行させるコンピュータプログラム。
    On the computer
    Get a request based on the voice input to the user interface installed in the accommodation,
    Obtain the response information corresponding to the obtained request from the database and
    The acquired response information is output to the user interface by voice.
    If the response information cannot be obtained from the database, switch to automatic response by input / output of text.
    A computer program that executes processing.
  16.  コンピュータに、
     宿泊施設に設置されたユーザインタフェースが出力するアクセス用情報を取得し、
     取得したアクセス用情報に基づいて、前記宿泊施設に係る情報を提供する情報提供装置にアクセスし、
     テキスト入力によるリクエストを取得し、
     前記リクエストに対応する応答情報を前記情報提供装置から取得してテキスト出力し、
     前記情報提供装置からの制御に基づいて、有人応答への切り替えに係る操作を受け付ける、
     処理を実行させるコンピュータプログラム。
    On the computer
    Acquires access information output by the user interface installed in the accommodation facility,
    Based on the acquired access information, access the information providing device that provides information related to the accommodation facility, and
    Get a request by text input and
    The response information corresponding to the request is acquired from the information providing device and output as a text.
    Based on the control from the information providing device, the operation related to the switching to the manned response is accepted.
    A computer program that executes processing.
  17.  前記リクエストのテキスト入力及び前記応答情報のテキスト出力に係る言語を選択する、処理を実行させる請求項16に記載のコンピュータプログラム。
     
     
    The computer program according to claim 16, wherein the processing is selected by selecting a language related to the text input of the request and the text output of the response information.

PCT/JP2020/036093 2019-09-27 2020-09-24 Information providing method, information providing system, information providing device, and computer program WO2021060391A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2021507724A JP6920773B1 (en) 2019-09-27 2020-09-24 Information provision method, information provision system, information provision device and computer program

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2019-177573 2019-09-27
JP2019177573 2019-09-27

Publications (1)

Publication Number Publication Date
WO2021060391A1 true WO2021060391A1 (en) 2021-04-01

Family

ID=75167012

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2020/036093 WO2021060391A1 (en) 2019-09-27 2020-09-24 Information providing method, information providing system, information providing device, and computer program

Country Status (2)

Country Link
JP (4) JP6920773B1 (en)
WO (1) WO2021060391A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2023238333A1 (en) * 2022-06-09 2023-12-14 三菱電機株式会社 Facility management system, notification device, and program

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6920773B1 (en) * 2019-09-27 2021-08-18 Tradfit株式会社 Information provision method, information provision system, information provision device and computer program
KR102597695B1 (en) * 2021-11-30 2023-11-02 주식회사 코리아세븐 Method of providing service by using chatbot, system for providing service by using chatbot, and computer program for the method
JP7054571B1 (en) 2021-12-03 2022-04-14 Kotozna株式会社 Communication system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003337866A (en) * 2002-05-20 2003-11-28 Shimizu Corp Integrated system for managing room environment/ information
JP2008109595A (en) * 2006-10-27 2008-05-08 Nec Corp Automatic speech method switching system, server, method, and program
US20140257963A1 (en) * 2013-03-05 2014-09-11 Edward Dwight Landis Techniques for delivering hotel services
JP2015153015A (en) * 2014-02-12 2015-08-24 株式会社 ネットシスジャパン Online settlement system
JP2019139584A (en) * 2018-02-13 2019-08-22 コニカミノルタ株式会社 Sentence input apparatus, and program

Family Cites Families (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3550503B2 (en) * 1998-11-10 2004-08-04 インターナショナル・ビジネス・マシーンズ・コーポレーション Method and communication system for enabling communication
JP2001111696A (en) 1999-10-04 2001-04-20 Ryoichi Komai Call center system using internet speech
JP4550252B2 (en) 2000-10-04 2010-09-22 株式会社エム・エム・エヌ Sales support method
JP2005051554A (en) 2003-07-29 2005-02-24 Mitsubishi Electric Corp Customer terminal and operator terminal in responding call center system
JP2009200537A (en) * 2008-01-21 2009-09-03 Nonomura Denshi Giken:Kk Pay television billing system
US20140114706A1 (en) * 2009-11-23 2014-04-24 John Ford Blakely System and method for an automated concierge
JP5592690B2 (en) * 2010-05-07 2014-09-17 株式会社日本総合研究所 Voice communication method
US8732753B2 (en) * 2010-06-07 2014-05-20 Guest Tek Interactive Entertainment Ltd. Method of operating one or more controllable devices in dependence upon commands received from a plurality of mobile devices and system controller thereof
WO2012006675A1 (en) * 2010-07-14 2012-01-19 Lahris Pty Ltd Accomodation systems
JP6152293B2 (en) 2013-04-24 2017-06-21 日本エイ・ティー・エム株式会社 Automated transaction equipment operation support system
JP6351562B2 (en) * 2014-11-12 2018-07-04 株式会社アドバンスト・メディア Information processing system, reception server, information processing method, and program
JP2017049635A (en) * 2015-08-31 2017-03-09 長野テクトロン株式会社 Shop guide system
JP2017098822A (en) * 2015-11-26 2017-06-01 沖電気工業株式会社 Control device, program, and control system
JP2018081147A (en) * 2016-11-14 2018-05-24 シャープ株式会社 Communication device, server, control method and information processing program
JP6848693B2 (en) * 2017-05-30 2021-03-24 沖電気工業株式会社 Customer service systems, customer service methods, customer service devices, and programs
JP6787269B2 (en) 2017-07-21 2020-11-18 トヨタ自動車株式会社 Speech recognition system and speech recognition method
JP6531138B2 (en) * 2017-07-31 2019-06-12 株式会社L is B Call center support system and operator's client
JP6863179B2 (en) 2017-08-29 2021-04-21 沖電気工業株式会社 Call center system, call center device, dialogue method, and its program with customer complaint detection function
JP6947655B2 (en) 2018-01-23 2021-10-13 エヌ・ティ・ティ・コミュニケーションズ株式会社 Multi-cloud chat service providing device, multi-cloud chat service providing method and multi-cloud chat service providing program
WO2020240838A1 (en) * 2019-05-31 2020-12-03 富士通株式会社 Conversation control program, conversation control method, and information processing device
JP6920773B1 (en) * 2019-09-27 2021-08-18 Tradfit株式会社 Information provision method, information provision system, information provision device and computer program

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003337866A (en) * 2002-05-20 2003-11-28 Shimizu Corp Integrated system for managing room environment/ information
JP2008109595A (en) * 2006-10-27 2008-05-08 Nec Corp Automatic speech method switching system, server, method, and program
US20140257963A1 (en) * 2013-03-05 2014-09-11 Edward Dwight Landis Techniques for delivering hotel services
JP2015153015A (en) * 2014-02-12 2015-08-24 株式会社 ネットシスジャパン Online settlement system
JP2019139584A (en) * 2018-02-13 2019-08-22 コニカミノルタ株式会社 Sentence input apparatus, and program

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2023238333A1 (en) * 2022-06-09 2023-12-14 三菱電機株式会社 Facility management system, notification device, and program

Also Published As

Publication number Publication date
JPWO2021060391A1 (en) 2021-10-14
JP6920773B1 (en) 2021-08-18
JP7066235B2 (en) 2022-05-13
JP2021166102A (en) 2021-10-14
JP2022089968A (en) 2022-06-16
JP2023073412A (en) 2023-05-25

Similar Documents

Publication Publication Date Title
JP6920773B1 (en) Information provision method, information provision system, information provision device and computer program
US10686951B2 (en) Methods and systems for accessing printing and scanning functions of a multi-function printer through natural language text or voice messages
KR20190049886A (en) Automatic suggestion responses to images received in messages using the language model
CN111095892B (en) Electronic device and control method thereof
JP2021061027A (en) Resolving automated assistant request that is based on image and/or other sensor data
KR20090086805A (en) Self-evolving artificial intelligent cyber robot system
JP2017152948A (en) Information provision method, information provision program, and information provision system
US20210073476A1 (en) Recording medium, conversation control method, and information processing apparatus
JP2017142353A (en) Language learning device, language learning method, and language learning program
KR102216783B1 (en) System for providing order service for non-face-to-face and noncontact based on speech recognition
KR101731425B1 (en) Cloud system and method for displaying contents thereof
JP2024520659A (en) Method, apparatus and system for dynamically navigating an interactive communication system - Patents.com
CN117061487A (en) Graphical user interface for a speech responsive system
JP2019056956A (en) Information processing system, information processing program, information processing apparatus and information processing method
US11651059B2 (en) User account matching based on a natural language utterance
KR102494944B1 (en) Contents creating method and a system thereof
JP7250383B1 (en) SYSTEM FOR COMMUNICATION USING A TERMINAL DEVICE
KR102512470B1 (en) System for providing payment on behalf of user shoppingmall platform service for senior
JP7319639B1 (en) Voice input system and its program
JP7363862B2 (en) Information processing system, chatbot system, information management method, program
JP7205962B1 (en) System for automatic dialogue
US20240153529A1 (en) Information processing apparatus, information input support system, and non-transitory recording medium
JP2003030091A (en) Multilanguage operating system
JP2022129582A (en) Information processing system, operator terminal, and program
JP2021103843A (en) Server device for automatic response of water heater repair, automatic response system, and program for automatic response

Legal Events

Date Code Title Description
ENP Entry into the national phase

Ref document number: 2021507724

Country of ref document: JP

Kind code of ref document: A

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20867722

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20867722

Country of ref document: EP

Kind code of ref document: A1