WO2019156535A1 - Interactive ai agent system and method for actively providing order or reservation service on basis of monitoring of dialogue session between users by using previous history information in dialogue session, and computer-readable recording medium - Google Patents
Interactive ai agent system and method for actively providing order or reservation service on basis of monitoring of dialogue session between users by using previous history information in dialogue session, and computer-readable recording medium Download PDFInfo
- Publication number
- WO2019156535A1 WO2019156535A1 PCT/KR2019/001692 KR2019001692W WO2019156535A1 WO 2019156535 A1 WO2019156535 A1 WO 2019156535A1 KR 2019001692 W KR2019001692 W KR 2019001692W WO 2019156535 A1 WO2019156535 A1 WO 2019156535A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- reservation
- order
- conversation
- users
- user
- Prior art date
Links
- 230000002452 interceptive effect Effects 0.000 title claims abstract description 128
- 238000000034 method Methods 0.000 title claims abstract description 36
- 238000012544 monitoring process Methods 0.000 title claims abstract description 20
- 230000004044 response Effects 0.000 claims abstract description 47
- 238000004891 communication Methods 0.000 claims description 54
- 235000013305 food Nutrition 0.000 claims description 30
- 238000012545 processing Methods 0.000 description 10
- 238000010586 diagram Methods 0.000 description 8
- 230000006870 function Effects 0.000 description 8
- 230000000007 visual effect Effects 0.000 description 7
- 238000007726 management method Methods 0.000 description 6
- 230000008569 process Effects 0.000 description 5
- 238000012790 confirmation Methods 0.000 description 4
- 230000009471 action Effects 0.000 description 3
- 230000003287 optical effect Effects 0.000 description 3
- 230000006399 behavior Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 238000013507 mapping Methods 0.000 description 2
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 238000009223 counseling Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 235000012054 meals Nutrition 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000003062 neural network model Methods 0.000 description 1
- 229920001690 polydopamine Polymers 0.000 description 1
- 235000009566 rice Nutrition 0.000 description 1
- 238000013179 statistical model Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/02—Reservations, e.g. for tickets, services or events
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/12—Hotels or restaurants
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
- G10L15/197—Probabilistic grammars, e.g. word n-grams
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/16—Speech classification or search using artificial neural networks
Definitions
- TECHNICAL FIELD This disclosure relates to active monitoring and intervention for an online conversation session between users by an interactive AI agent system, and more specifically, an interactive AI agent system monitors an online conversation session between users and separates from the user. Without explicit and specific request, the present invention relates to a technology for actively providing an order or reservation service for a plurality of users participating in the conversation session, if actively determined as necessary.
- online conversation services eg, instant messenger services
- online conversations allow people to exchange conversations on a variety of topics in real time with one or more remote parties.
- online chat services have been increasingly combined with interactive AI agent systems to allow people to obtain desired information or services from interactive AI agent systems as if they were talking to friends through online chat messages. have.
- a method of providing an order or reservation service based on active monitoring of a conversation session between multiple users provided by an interactive AI agent system.
- the method of the present disclosure includes receiving user input from one or more of a plurality of users during a chat session, wherein the user input is a chat input to and from a plurality of users; Determining a predetermined user intent corresponding to the user input based on the predetermined knowledge base; Based on the received user intent, determining whether a user intent for a predetermined order or reservation for a predetermined number or more of the plurality of users has been obtained; In response to determining that a user intent wishing for a predetermined order or reservation for a predetermined number or more of users has been obtained, generating a conversation flow corresponding to the user intent; Retrieving records of orders or reservations provided on the chat session by the interactive AI agent system prior to any orders or reservations for the predetermined number of users; And performing a conversation and / or task for the given order or reservation based on
- a given order or reservation may be associated with one of food, goods, and services.
- Performing the conversation and / or task may include determining, from a record regarding the retrieved order or reservation, a company to provide one of the food, goods, and services associated with the given order or reservation.
- performing the conversation and / or task comprises determining each desired item for each of the predetermined number or more of users associated with the given order or reservation from a record relating to the retrieved order or reservation. It may include.
- performing the conversation and / or task may include determining payment information associated with the given order or reservation from a record regarding the retrieved order or reservation.
- performing the conversation and / or task may include determining address information associated with the given order or reservation from a record regarding the retrieved order or reservation.
- a given order or reservation is associated with one of food, goods and services
- performing a conversation and / or task comprises: food, goods and service associated with a given order or reservation
- Performing a conversation and / or task for determining a company to provide one of wherein performing the conversation and / or task for determining a company comprises: providing a list of a plurality of companies; Receiving a selection of one company from a list of a plurality of companies from each of the predetermined number of users; And determining the business based on a result of receiving the selection of the business from each of the predetermined number of users or more.
- performing the conversation and / or task comprises: performing a conversation and / or task for determining each desired item associated with a given order or reservation for each of a predetermined number of users. And performing a conversation and / or task for determining each desired item associated with the given order or reservation for each of the predetermined number of users includes: providing a list of a plurality of items; And receiving a selection of one item from a list of the plurality of items from each of the predetermined number of users.
- performing a conversation and / or a task includes proceeding with a payment associated with a given order or reservation, and proceeding with a payment associated with a given order or reservation, It may include the step of making a payment with respect to each of a predetermined number or more of the users or a representative of the predetermined number or more of the users.
- a computer readable recording medium including one or more instructions, wherein the one or more instructions, when executed for a computer, cause the computer to perform any one of the methods described above.
- a computer readable recording medium is provided.
- an interactive AI agent system configured to provide monitoring for a multi-user chat session and to provide active ordering or reservation service based thereon, comprises: a communication module; NLU module; And a conversation session monitoring / intervention module is provided.
- the communication module is configured to receive user input from one or more of the plurality of users during the chat session, wherein the user input is a conversation input to and from a plurality of users, and the NLU module And determine, based on a given knowledge base, a predetermined user intent corresponding to the user input, and the chat session monitoring / intervention module based on the received user intent, selecting a predetermined number or more of the plurality of users.
- a conversation corresponding to the user intent in response to determining that a user intent wishing for a predetermined order or reservation for the user has been obtained and determining that a user intent wishing for a predetermined order or reservation for a predetermined number of users has been obtained.
- Create a flow and place a predetermined order or reservation for a given number of users Retrieve a record about an order or reservation previously provided on a chat session by the interactive AI agent system, and based on the generated chat flow and the record of the retrieved order or reservation, a conversation and / or task for a given order or reservation It is configured to perform.
- the user does not have to actively open a separate conversation session or make a separate specific request for the interactive AI agent system. You can immediately get the information or services you want from them. According to the present disclosure, since the interactive AI agent system operates as if it is another human user participating in a conversation between users, the user can conveniently obtain appropriate services in a timely manner.
- FIG. 1 is a diagram schematically illustrating a system environment in which an interactive AI agent system may be implemented, according to an embodiment of the present disclosure.
- FIG. 2 is a functional block diagram schematically illustrating the functional configuration of the user terminal 102 of FIG. 1, according to an embodiment of the disclosure.
- FIG. 3 is a functional block diagram schematically illustrating the functional configuration of the interactive AI agent server 108 of FIG. 1, in accordance with an embodiment of the present disclosure.
- FIG. 4 provides for monitoring of a chat session between users through an online chat service server 106 and provision of an order / reservation service based thereon, performed by the interactive AI agent server 108, according to one embodiment of the disclosure.
- FIG. 5A through 5F illustrate that, according to one embodiment of the present disclosure, an interactive AI agent server 108 intervenes a conversation session between users through an online conversation service server 106 to perform a corresponding conversation session.
- This is a screen showing an exemplary chat window when a predetermined order service is provided on the screen.
- the module or the unit means a functional part that performs at least one function or operation, and may be implemented in hardware or software or in a combination of hardware and software. Also, a plurality of 'modules' or 'units' may be integrated by at least one software module and implemented by at least one processor, except for 'modules' or 'units', which need to be implemented by specific hardware. have.
- the interactive AI agent system may be a natural language input (eg, a natural language) input from a user through an interactive interaction with a user through a natural language in the form of voice and / or text.
- a natural language input eg, a natural language
- the dialogue response provided by the 'interactive AI agent system' may be in visual, auditory and / or tactile form (eg, voice, sound, text, video, image, symbol, emoticon, hyperlink, It may be included in various forms such as, but not limited to, animation, various knots, motion, haptic feedback, and the like.
- Tasks performed by the 'interactive AI agent system' in an embodiment of the present disclosure may include, for example, searching for information, placing an order, making a reservation or making a payment, writing an SMS message, writing an email, making a call, searching a user location, and the like. It may include various types of tasks (however, they are illustrative only and not limiting).
- FIG. 1 is a diagram schematically illustrating a system environment 100 in which an interactive AI agent system may be implemented, in accordance with an embodiment of the present disclosure.
- the system environment 100 includes a plurality of user terminals 102, a communication network 104, an online chat service server 106, an interactive AI agent server 108, and an external service server 110. Include.
- each of the plurality of user terminals 102 may be any user electronic device having a wired or wireless communication function.
- Each of the user terminals 102 may be a variety of wired or wireless communication terminals, including, for example, smart phones, tablet PCs, music players, smart speakers, desktops, laptops, PDAs, game consoles, digital TVs, set-top boxes, and the like. It should be understood that this is not limited.
- each of the user terminals 102 may communicate with the online chat service server 106, that is, transmit and receive necessary information through the communication network 104.
- each of the user terminals 102 establishes and establishes a chat session with one or a plurality of other user terminals 102 through the communication network 104 and the online chat service server 106.
- the conversation session may be used to exchange a real-time conversation with one or more other user terminals 102 in a voice and / or text manner.
- each of the user terminals 102 receives a user input in the form of voice and / or text from the outside, and establishes a conversation through the communication network 104 and the online chat service server 106. Through the session, the user input may be provided to another user terminal 102 participating in the conversation session.
- each of the user terminals 102 may communicate with the interactive AI agent server 108, that is, transmit and receive necessary information through the communication network 104.
- each of the user terminals 102 may communicate with the external service server 110, that is, transmit and receive necessary information through the communication network 104.
- each of the user terminals 102 may receive user input in the form of voice, text, and / or touch from the outside, and communicate with the interactive AI agent server 108 via the communication network 104. And / or an operation result corresponding to the user input (eg, providing a specific conversation response and / or performing a specific task) obtained through communication with the external service server 110 (and / or processing in the user terminal 102). Etc.) to the user.
- performing a task as an operation corresponding to a user input may be performed in various ways including, for example, searching for information, placing an order, making a reservation, or making a payment, writing an SMS message, writing an email, making a phone call, searching a user location, and the like. It may include performing (but not limited to) forms of tasks.
- each of the user terminals 102 may transmit a conversation response as a result of an operation corresponding to a user input in a visual, audio, and / or tactile form (eg, voice, sound, text, video, image, Symbols, emoticons, hyperlinks, animations, various knots, motion, haptic feedback, and the like, but are not limited thereto.
- communication network 104 may include any wired or wireless communication network, such as a TCP / IP communication network.
- the communication network 104 may include, for example, a Wi-Fi network, a LAN network, a WAN network, an Internet network, and the like, but the present invention is not limited thereto.
- the communication network 104 may include, for example, Ethernet, GSM, Enhanced Data GSM Environment (EDGE), CDMA, TDMA, OFDM, Bluetooth, VoIP, Wi-MAX, Wibro, and any other various wired or wireless. It may be implemented using a communication protocol.
- EDGE Enhanced Data GSM Environment
- the online chat service server 106 may communicate with the user terminal 102 via the communication network 104.
- the on-line chat service server 106 may be connected to the user terminal 102 and one or more other users according to a request received from the user terminal 102 through, for example, the communication network 104.
- a conversation session may be established between the terminals 102 and a real time conversation may be performed between the user terminals 102 participating in the conversation session.
- the online chat service server 106 may be an instant messenger service providing server.
- the online chat service server 106 when the online chat service server 106 establishes a chat session between the user terminals 102 in response to a request from the user terminal 102, the online chat service server 106 is connected to the chat session according to a user selection. It is possible to determine whether to use one or a plurality of interactive AI agent systems for each other. (In this embodiment, monitoring of a conversation session and service provision by the interactive AI agent system described later may be performed at the time of establishment of the conversation session. If selected to use the interactive AI agent system).
- user input in the form of voice and / or text received on each user terminal 102 is via a chat session established via communication network 104 and online chat service server 106. The data may be transferred to another user terminal 102 participating in the conversation session.
- the interactive AI agent server 108 may communicate with the user terminal 102 via the communication network 104.
- the interactive AI agent server 108 receives user input in the form of voice, text, and / or touch from the user terminal 102, for example, via the communication network 104, and has prepared knowledge in advance. Based on the base models, the received user input can be processed to determine the user's intent.
- the interactive AI agent server 108 may generate and send to the user terminal 102 a specific conversation response that corresponds, for example, to the user intent.
- the interactive AI agent server 108 monitors a chat session between the plurality of user terminals 102 via the online chat service server 106, for example, via the communication network 104. , Each user natural language input transmitted and received between the user terminals 102 on the conversation session. According to an embodiment of the present disclosure, the interactive AI agent server 108 may determine the intent of each user by processing the received natural language input based on previously prepared knowledgebase models.
- the interactive AI agent server 108 may, based on a user intent obtained from a chat session between the user terminals 102 via the online chat service server 106, determine a predetermined criterion. As a result, it may be determined whether an order or reservation service for part or all of the user terminal 102 participating in the conversation session is required. According to one embodiment of the present disclosure, when the interactive AI agent server 108 determines that the order or reservation service is required, the interactive AI agent server 108 generates a corresponding conversation and task flow, intervenes in the corresponding conversation session, and generates the conversation. Conversations and task performance can be provided.
- the interactive AI agent server 108 generates a conversation response corresponding to the user intent determined above in voice and / or text form, and generates the generated response in the communication network 104. Via the user terminal 102 and / or the online chat service server 106.
- the conversation response generated by the interactive AI agent server 108 may be accompanied by other visual elements such as images, videos, symbols, and emoticons, in addition to the aforementioned natural language responses in the form of voice and / or text. Or other acoustic elements such as sound, sound, or other tactile elements.
- the same type of response is generated on the interactive AI agent server 108, depending on the type of user input received on the user terminal 102 (eg, voice input or text input).
- a voice response is generated if a voice input is given and a text response is generated if a text input is given
- responses in the form of voice and / or text may be generated and provided regardless of the form of user input.
- the interactive AI agent server 108 may allow a task corresponding to the determined user intent to be performed.
- the interactive AI agent server 108 intervenes in a chat session between user terminals 102 via, for example, an online chat service server 106 to match a user intent on that chat session.
- a specific task performing (eg, order or reservation) procedure may be performed.
- the interactive AI agent server 108 may connect to the external service server 110 through the communication network 104 to perform a specific task that matches the user intent.
- the interactive AI agent server 108 may communicate with the external service server 110 via the communication network 104, as mentioned above.
- the external service server 110 may be, for example, various online reservation center servers, various food order service servers, online shopping mall servers, information retrieval servers, various payment agency servers, location search servers, and the like, but the present disclosure is not limited thereto.
- the response based on the user intent which is passed from the interactive AI agent server 108 to the user terminal 102 and / or the online chat service server 106, may be, for example, an external service server ( It should be understood that the data content may include data content retrieved and obtained from 110.
- the interactive AI agent server 108 is shown as a separate physical server configured to communicate with the external service server 110 through the communication network 104, but the present disclosure is not limited thereto.
- the interactive AI agent server 110 may include, for example, various online reservation center servers, various food order service servers, online shopping mall servers, information retrieval servers, various payment agency servers, location search servers, and the like. It should be understood that it may be included as part of the service server.
- the user terminal 102 includes a user input receiving module 202, a sensor module 204, a program memory module 206, a processing module 208, a communication module 210, and a response output module ( 212).
- the user input receiving module 202 may input various types of inputs from a user, for example, natural language inputs such as voice inputs and / or text inputs (and additionally other types of inputs such as touch inputs). ) Can be received.
- the user input receiving module 202 may include, for example, a microphone and an audio circuit, and may acquire a user voice input signal through the microphone and convert the obtained signal into audio data.
- the user input receiving module 202 may include various pointing devices such as a mouse, a joystick, a trackball, a keyboard, a touch panel, a touch screen, a stylus, and the like.
- the user may acquire text input and / or touch input signals input from the user through these input devices.
- the user input received by the user input receiving module 202 may be associated with performing a predetermined task, for example, executing a predetermined application or retrieving predetermined information, but the present disclosure is limited thereto. It is not.
- the sensor module 204 includes one or more different types of sensors, through which the status information of the user terminal 102, such as the physical state of the corresponding user terminal 102, The software and / or hardware state or information about the environment state of the user terminal 102 may be obtained.
- the sensor module 204 may include, for example, an optical sensor, and detect an ambient light state of the corresponding user terminal 102 through the optical sensor.
- the sensor module 204 may include, for example, a movement sensor, and detect whether the corresponding user terminal 102 is moved through the movement sensor.
- the sensor module 204 may include, for example, a speed sensor and a GPS sensor, and may detect a position and / or orientation state of the corresponding user terminal 102 through these sensors. According to another embodiment of the present disclosure, it should be appreciated that the sensor module 204 may include other various types of sensors, including temperature sensors, image sensors, pressure sensors, contact sensors, and the like.
- the program memory module 206 may be any storage medium in which various programs, for example, various application programs and related data, which may be executed on the user terminal 102, are stored.
- program memory module 206 includes various application programs, including, for example, instant messaging applications, dialing applications, email applications, map applications, web browser applications, payment and / or remittance applications, and the like. Data related to the execution of these programs can be stored.
- the program memory module 206 may be configured to include various types of volatile or nonvolatile memory such as DRAM, SRAM, DDR RAM, ROM, magnetic disk, optical disk, flash memory, and the like. .
- the processing module 208 may communicate with each component module of the user terminal 102 and perform various operations on the user terminal 102. According to one embodiment of the present disclosure, the processing module 208 may drive and execute various application programs on the program memory module 206. According to one embodiment of the present disclosure, the processing module 208 may receive the signals obtained by the user input receiving module 202 and the sensor module 204 and perform appropriate processing on these signals, if necessary. have. According to one embodiment of the present disclosure, the processing module 208 may perform appropriate processing on a signal received from the outside through the communication module 210 if necessary.
- the communication module 210 is configured such that the user terminal 102 is connected to the online chat service server 106, the interactive AI agent server 108, and / or via the communication network 104 of FIG. 1. Or to communicate with an external service server 110.
- the communication module 210 may be configured such that, for example, signals acquired on the user input receiving module 202 and the sensor module 204 may be connected to the online chat service server through the communication network 104 according to a predetermined protocol.
- 106, interactive AI agent server 108 and / or external service server 110 may be configured such that, for example, signals acquired on the user input receiving module 202 and the sensor module 204 may be connected to the online chat service server through the communication network 104 according to a predetermined protocol.
- the communication module 210 receives from the online chat service server 106, the interactive AI agent server 108 and / or the external service server 110, for example, via the communication network 104.
- Various signals for example, response signals including natural language responses in the form of voice and / or text, various control signals, and the like, may be received, and appropriate processing may be performed according to a predetermined protocol.
- the response output module 212 may output a response corresponding to a user input in various forms such as visual, auditory, and / or tactile.
- the response output module 212 includes various display devices such as a touch screen based on technologies such as LCD, LED, OLED, QLED, etc., and responds to user input through these display devices.
- Visual responses such as text, symbols, videos, images, hyperlinks, animations, various notes, and the like, to the user.
- the response output module 212 includes, for example, a speaker or a headset, and provides an audible response, such as a voice and / or acoustic response, to the user via the speaker or the headset, corresponding to a user input. can do.
- the response output module 212 may include a motion / haptic feedback generator, thereby providing a tactile response, eg, motion / haptic feedback, to a user.
- the response output module 212 can simultaneously provide any two or more combinations of text response, voice response, and motion / haptic feedback corresponding to user input.
- the interactive agent server 108 may include a communication module 302, a speech-to-text (STT) module 304, a natural language understanding (NLU) module 306. ), Conversation understanding knowledge base 308, user database 310, conversation session monitoring / intervention 312, conversation management module 314, conversation generation module 316, and speech synthesis (Text-To-Speech; TTS) module 318.
- STT speech-to-text
- NLU natural language understanding
- Conversation understanding knowledge base 308 user database 310
- conversation session monitoring / intervention 312 conversation management module 314, conversation generation module 316, and speech synthesis (Text-To-Speech; TTS) module 318.
- TTS speech synthesis
- the communication module 302 is configured such that the interactive AI agent server 108 communicates with the user terminal 102 via the communication network 104 according to a predetermined wired or wireless communication protocol. Enable communication with service server 106 and / or external service server 110.
- the communication module 302 is a user input (eg, touch input, transmitted from the user terminal 102 and / or the online chat service server 106) via the communication network 104. Voice input and / or text input, and the like, but not limited thereto).
- the communication module 302 is transmitted from the user terminal 102 or the like via the communication network 104, together with or separately from the user input described above, to the user terminal 102.
- the state information may include, for example, various state information related to the corresponding user terminal 102 at the time of the user input (for example, the physical state of the user terminal 102 and the software of the user terminal 102). And / or hardware status, environmental status information around the user terminal 102, and the like.
- the communication module 302 may further include a conversation response generated by the interactive AI agent server 108 in response to the received user input (eg, natural language in the form of voice and / or text). Chat response, etc.) and / or control signals, may be performed through the communication network 104, appropriate measures necessary to communicate to the user terminal 102 and / or the online chat service server 106.
- the STT module 304 may receive a voice input among user inputs received through the communication module 302 and convert the received voice input into text data based on pattern matching or the like. have. According to one embodiment of the present disclosure, the STT module 304 may generate a feature vector sequence by extracting features from a user's voice input. According to an embodiment of the present disclosure, the STT module 304 may include a dynamic time warping (DTW) method, a HMM model (Hidden Markov Model), a GMM model (Gaussian-Mixture Mode), a deep neural network model, an n-gram model, and the like. Based on a variety of statistical models of, a text recognition result, such as a sequence of words, can be generated. According to an embodiment of the present disclosure, the STT module 304 may refer to each user characteristic data of the user database 310 to be described later when converting the received voice input into text data based on pattern matching. .
- DTW dynamic time warping
- the NLU module 306 may receive a text input from the communication module 302 or the STT module 304.
- text input received at the NLU module 306 is received from the user terminal 102 and / or the online chat service server 106 via the communication network 104, for example, in the communication module 302. It may be a text recognition result, eg, a sequence of words, generated at STT module 304 from a user text input that was received, or from a user voice input received at communication module 302.
- the status information may include, for example, various status information related to the corresponding user terminal 102 at the time of user voice input and / or text input (eg, physical state of the user terminal 102 and software). And / or hardware status, environmental status information around the user terminal 102, and the like.
- the NLU module 306 may map the received text input to one or more user intents based on the conversation understanding knowledge base 308 described below.
- a user intent here may be associated with a series of action (s) that can be understood and performed by the interactive AI agent server 108 in accordance with the user intent.
- the NLU module 306 may refer to the aforementioned state information in mapping the received text input to one or more user intents.
- the NLU module 306 may reference each user characteristic data of the user database 310 described below in mapping the received text input to one or more user intents.
- conversation understanding knowledge base 308 may include, for example, a predefined ontology model.
- the ontology model may be represented, for example, in a hierarchical structure between nodes, each node being a "intent" node or a child linked to an "intent" node corresponding to the user's intent. It may be one of the "attribute" nodes.
- the ontology model may be dynamically changed by adding or deleting nodes, or modifying relationships between nodes.
- the conversation understanding knowledge base 308 may be configured to include domains corresponding to all intents, for example, that the interactive AI agent system can understand and perform operations corresponding thereto. Can be.
- intent nodes and attribute nodes of each domain in the ontology model may be associated with words and / or phrases related to corresponding user intents or attributes, respectively.
- the dialogue understanding knowledge base 308 comprises an ontology model consisting of nodes in a hierarchy and a set of words and / or phrases associated with each node, for example in the form of a lexical dictionary (specific (Not shown), the NLU module 306 may determine the user intent based on the ontology model implemented in the form of a lexical dictionary.
- the NLU module 306 when the NLU module 306 receives a text input or a sequence of words, the NLU module 306 may determine which words in which domain in the ontology model are associated with each word in the sequence, Based on such a determination, the corresponding domain, ie user intent, can be determined.
- the user database 310 may be a database that stores and manages characteristic data for each user.
- the characteristic data for each user included in the user database 310 may include, for example, a previous conversation / behavior record of the user for each user, a previous order or reservation record for the user (eg, Previous orders or booking company / item information, shipping address information of previous orders or foods, payment information entered by the user, and other records related to previous orders or reservations), user pronunciation feature information, user vocabulary preferences, It may include the user's location, setting language, contact / friend list, and various other user characteristic information.
- the STT module 304 refers to each user characteristic data of the user database 310, for example, a pronunciation characteristic for each user, when converting a voice input into text data. You can get more accurate text data.
- the NLU module 306 determines more accurate user intent by referring to each user characteristic data of the user database 310, such as each user's characteristic or context, when determining the user intent. can do.
- the conversation generating module 316 may refer to user characteristic data of the user database 310 when generating a conversation response.
- a user database 310 for storing and managing characteristic data for each user is illustrated as being arranged in the interactive AI agent server 108, but the present invention is not limited thereto. According to another embodiment of the present disclosure, a user database 310 for storing and managing characteristic data for each user may exist, for example, in the user terminal 102, and the user terminal 102 and the interactive AI agent server 108. It should be noted that they may be distributed in
- the chat session monitoring / intervention 312 is configured for each chat item on the chat session established between the plurality of user terminals 102 via the online chat service server 106, that is, each user.
- the input can be monitored.
- the chat session monitoring / intervention 312 is provided from each user terminal 102 on a chat session between the plurality of user terminals 102 via the online chat service server 106. Based on the conversation item, i.e., each user input, each corresponding user intent obtained from the NLU module 306 may be obtained.
- the conversation session monitoring / intervention unit 312 is based on the user intent and according to a predetermined criterion, the situation in which the interactive AI agent server 108 should intervene in the conversation session. Can be determined.
- the interactive AI agent server 108 may include one or more user intents corresponding to one or more user inputs obtained, for example, on a conversation session between the plurality of user terminals 102. If it is desired to execute a given order or reservation for all or some of the (102) (but this disclosure is not so limited), it is determined in advance that it must intervene in the conversation session to provide the necessary services. You can put it.
- the conversation management module described below. 314 and the conversation response generated by the conversation creation module 316 or the like may be provided on the corresponding conversation session through the online conversation service server 106.
- the conversation management module 314 may generate a series of operation flows corresponding thereto according to the user intent determined by the NLU module 306. According to one embodiment of the present disclosure, the conversation management module 314 is based on a certain conversation flow management model, for example, in response to a user intent received from the NLU module 306, for example, any conversation progress and And / or determine whether to perform a task execution and generate a detailed operational flow accordingly.
- the conversation generation module 316 may generate a conversation response to be provided to the user based on the conversation flow generated by the conversation management module 314.
- the conversation generating module 316 may, in the generation of a conversation response, present the user characteristic data (eg, a previous conversation / behavior record of the user, the user, etc.) of the user database 310 described above. Prior order or reservation records, user's pronunciation feature information, user's vocabulary preferences, user's location, setting language, contact / friend list, and various other user characteristic information).
- the TTS module 318 may receive a conversation response generated to be sent to the user terminal 102 by the conversation generation module 316.
- the conversation response received at the TTS module 318 may be a natural language or a sequence of words in text form.
- the TTS module 318 may convert the received text form input into a speech form according to various types of algorithms.
- the interactive AI agent system includes a user terminal 102 and an interactive AI agent server 108 in a client-server model, in particular a client only with user input / output functions. All other functions of the interactive AI agent system are provided as described and implemented based on a so-called "thin client-server model," which is delegated to a server, but the invention is not so limited. According to another embodiment of the present disclosure, the interactive AI agent system should be aware that the functions may be implemented by being distributed between the user terminal and the server, or alternatively, may be implemented as a standalone application installed on the user terminal. .
- the interactive AI agent system distributes the functions between the user terminal and the server according to an embodiment of the present disclosure
- the distribution of each function of the interactive AI agent system between the client and the server may vary from embodiment to embodiment. It should be appreciated that other implementations may be made.
- a specific module has been described as performing certain operations, but the present invention is not limited thereto. According to another embodiment of the present disclosure, it should be understood that the operations described as performed by any particular module in the above description may be performed by a separate module from each other.
- FIGS. 4 and 5 provides for monitoring of a chat session between users and an order / reservation service accordingly through an online chat service server 106, performed by the interactive AI agent server 108, according to one embodiment of the disclosure.
- 5A through 5E illustrate that, according to an embodiment of the present disclosure, the interactive AI agent server 108 intervenes a conversation session between users through an online conversation service server 106 to perform a corresponding conversation session.
- This is a screen showing an exemplary chat window when a predetermined order service is provided on the screen.
- FIGS. 4 and 5 an exemplary operation flow of the interactive AI agent server 108 according to an embodiment of the present disclosure will be described with reference to FIGS. 4 and 5.
- step 402 the interactive AI agent server 108, on a chat session established between a plurality of user terminals 102 via an online chat service server 106, each user terminal.
- Each user input from 102 may be received.
- the received user input may be a text and / or voice input received on each corresponding user terminal 102.
- the interactive AI agent server 108 may analyze each user input obtained at step 402 and determine each corresponding user intent.
- the interactive AI agent server 108 may map each user input to one or more user intentions based on a predetermined knowledge base.
- the interactive AI agent server 108 may be specialized for providing a service in the field of ordering or reservation. In such a case, the knowledge base of the interactive AI agent server 108 is specialized in that area of expertise and may allow user intentions related to that field to be more accurately determined.
- the interactive AI agent server 108 based on each user intent obtained, has a predetermined number or more (all participants or some participants) among the user terminals 102 participating in the corresponding conversation session. It may be determined whether an intention to desire a predetermined order or reservation for the user terminals 102 has been obtained, and based on that, it may be determined whether to participate in the conversation session to start providing the order or reservation service.
- the interactive AI agent server 108 includes a plurality of participants participating in the conversation session, for example, on a conversation session between the plurality of user terminals 102 via the online conversation service server 106.
- the interactive AI agent server 108 intends to order a delivery food for lunch for all users participating in the conversation session. If it is obtained, it may be determined to start the order service of the delivery food for the whole user by intervening the corresponding conversation session.
- the intention to desire a predetermined order or reservation for a predetermined number of users may be explicitly obtained from each user, but from the interpretation of the intent of some of the parties participating in the chat session. It should be noted that it may be implicitly obtained (eg, a particular user on a chat session may represent multiple users representing their intention to wish for a common order or reservation together).
- step 406 If it is determined in step 406 that the intention to wish for a predetermined order or reservation for a predetermined number or more of the user terminals 102 has not yet been obtained, then the procedure returns to step 402 and the interactive AI agent server 108. ) Can continue to monitor the conversation afterwards, providing no response.
- the interactive AI agent server 108 has acquired the intention to wish a predetermined order or reservation for a predetermined number or more of the user terminals 102 and thus participate in the conversation session to place the order. Or if it is determined to start providing the reservation service, the procedure proceeds to step 408 and the interactive AI agent server 108 proceeds to the flow of operations (providing a conversation and / or performing a task) for the corresponding predetermined order or reservation. And the like).
- the interactive AI agent server 108 may, in connection with a corresponding order or reservation, determine who will be the provider of food, goods, or services that are the subject of the order or reservation. You can create an operational flow.
- the interactive AI agent server 108 in connection with a corresponding order or reservation, from each user terminal 102 who wished to place the order or reservation, food related to the order or reservation, An operation flow may be created that determines specific items (kinds) of goods or services.
- the interactive AI agent server 108 may generate an operational flow for proceeding with payment for the order or reservation in relation to the corresponding order or reservation.
- step 410 the interactive AI agent server 108, between the user terminals 102 through the online chat service server 106, in accordance with the predetermined operational flow generated in step 408 above.
- a chat and / or task may be performed to determine who will be the provider of the food, goods or service that is the subject of that order or reservation.
- the interactive AI agent server 108 decides to initiate an ordering service of a delivery meal for a plurality of users who participated in a conversation session, for example, intervening on the conversation session, Prompt selection of restaurants to order food and receive and collect responses from user (s) thereon.
- the interactive AI agent server 108 provides a list of restaurants that can order food for delivery, e.g., intervening on a corresponding chat session, and from each user terminal 102 each. Receive and collect vendor choices. According to one embodiment of the present disclosure, the interactive AI agent server 108 selects from each user terminal 102, for example, by first searching for vendors capable of ordering delivery food at the current location and providing the searched results on the corresponding chat session. Can receive and collect.
- the interactive AI agent server 108 determines whether there is a record of a previously performed order or reservation on the conversation session, and that there is a record of a previously performed order or reservation. In the case of a determination, priority is given to the company information on the record (ie, the food, goods or service subject to the order or reservation in the record of the order or reservation previously performed by the interactive AI agent server 108 on the conversation session). Provider information, etc.) may be presented in a simple visual form (e.g., various forms including lists, buttons, etc.).
- the interactive AI agent server 108 first presents each vendor information associated with a previously performed order or reservation on the corresponding conversation session, thereby allowing a user to select one of the presented vendor information. If there is a business that you want to use again for this order or reservation, you may be urged to select that business (eg, a selection or button on the list about that business).
- the interactive AI agent server 108 may display a plurality of company information related to an order or reservation previously performed on the conversation session. While presenting, the user on the corresponding conversation session may be urged to select if there is a company information to be used for this order or reservation among the plurality of company information presented.
- the interactive AI agent server 108 may provide items that can be selected when the user wishes to select a new company.
- step 412 the interactive AI agent server 108 participates in each order or reservation.
- a conversation and / or task may be performed for the user terminal 102 to identify each specific order or item of the reservation (eg, the specific item of the food to be ordered, the specific item of the service to be booked, etc.).
- the interactive AI agent server 108 may request each order or reservation for a plurality of users participating in the chat session. , Prompting the user to select a product or item of service and receiving and collecting each answer from the user terminals 102 thereon.
- the interactive AI agent server 108 provides a list of items for ordering or booking, for example, intervening in a corresponding chat session, and selecting each item from each user terminal 102. Can receive and collect.
- the interactive AI agent server 108 may, for example, determine an order or reservation previously performed in determining a provider of food, goods or services subject to the order or reservation in step 410. If the associated business is selected, the corresponding item information in the record of the previous order or reservation for that business (ie, food, merchandise that was the subject of the order or reservation in a previous order or reservation for that business). Or each specific item information of the service) in a simple visual form on the corresponding conversation session. According to one embodiment of the present disclosure, the interactive AI agent server 108 preferentially presents item information associated with a previously performed order or reservation on the conversation session, thereby allowing each user to present the presented item information. You can ask if you want to use it again for this order or reservation.
- the interactive AI agent server 108 may correspond to item information related to an order or reservation previously performed with respect to a specific company. While presenting on the chat session, each user on the chat session can be selected to use the presented item information for the current order or reservation or to select a new item. According to one embodiment of the present disclosure, for example, as shown in FIG. 5C, the interactive AI agent server 108 may order lunch delivery food for all user terminals 102 participating in a conversation session. In the case of using the Chinese restaurant OO, which has been used previously, it is possible to present item information for each user at the time of the order previously performed for the Chinese restaurant OO in the conversation session. As illustrated in FIG.
- the interactive AI agent server 108 receives, for example, the use of existing item information from users A and B, but selects another item from user C.
- the menu of the Chinese restaurant OO may be presented on the conversation session, and the desired food item may be received from the user C.
- the procedure proceeds to step 414 and the interactive AI agent server 108, if necessary, one or more From the user terminal 102 can receive address information about the order or reservation.
- the interactive AI agent server 108 prioritizes address information associated with a previously performed order or reservation on the conversation session, as shown in FIG. And ask each user if they wish to modify the presented address information.
- the procedure proceeds to step 416 and the interactive AI agent server 108 is placed on the order or reservation received so far. It is possible to obtain confirmation from one or more user terminals 102 by presenting options and the like.
- the interactive AI agent server 108 will order food from Chinese restaurant OO for users A to C, and for user A 1 Jajang-myeon, for user B 2 fried rice, for user C 3 champon will be ordered, and the address can be checked to see if it is XXX in Dongjak-gu, Seoul.
- the interactive AI agent server 108 may omit the procedure for obtaining confirmation from the user terminals 102 of step 416.
- step 416 if a confirmation regarding the order or reservation is obtained from the user terminal 102, the procedure proceeds to step 418, and the interactive AI agent server 108 makes a payment regarding the order or reservation. You can proceed.
- the interactive AI agent server 108 may provide options for payment on a chat session, for example, as shown in FIG. You can proceed with the required payment process.
- payment for an order or a reservation for a plurality of users may be performed for each user, or for one representative for all users (see FIG. 5 (f)). It should be understood that the invention is not limited to any particular example.
- a subsequent transfer process of an amount related to itself from each user to the representative may be separately performed, but the present invention is limited thereto. It doesn't happen.
- the payment process may be made through a connection to a separate server for payment or a separate app execution, and the like, and is not limited to the specific embodiment of the present disclosure. do.
- a person to make a payment is determined (for example, payment by one representative)
- a user database about a person who is to make the payment is searched and the payment information used for a previous order or reservation (for example, it is possible to determine whether card information or the like exists, and if such information exists, the user can be queried whether to use existing payment information again.
- the interactive AI agent server 108 retrieves existing payment information (eg, card information) for the representative A.
- the representative A may be asked to select whether or not to use the existing payment information. As shown in (f) of FIG.
- the representative A has chosen to use the existing payment information, and the payment process proceeds using the existing information.
- the representative A may notify that the new payment information is to be used and proceed with the payment procedure accordingly.
- the interactive AI agent server 108 depends only on its own judgment (eg, whether the user intention requires the provision of an order or reservation service by the interactive AI agent server 108). It should be noted that the present invention is described as determining whether to intervene in a conversation session between users, but the present invention is not limited thereto. According to another embodiment of the present disclosure, the interactive AI agent server 108 does not communicate when the user calls a predetermined name (eg, a name predetermined to refer to the corresponding interactive AI agent) during the conversation session. Intervene in the session to proceed with the order or reservation.
- a predetermined name eg, a name predetermined to refer to the corresponding interactive AI agent
- the interactive AI agent server 108 since the interactive AI agent server 108 has continuously performed monitoring of each user input of the corresponding chat session, it means that the user receives the final confirmation of the intervention into the chat session by simply calling the user. It should be noted that the user does not need to explicitly request the desired service from the interactive AI agent server 108 again.
- the interactive AI agent server 108 when the interactive AI agent server 108 provides an order or reservation service for a user, it sequentially checks the matters related to each company, item, and payment related to the order or reservation service.
- the present disclosure is not limited thereto.
- the agent server 108 may omit the procedure of confirming the matter regarding the company and the item, and proceed with the task according to the record of the user.
- the description has been mainly focused on an instant message in which a conversation content of a conversation session is presented on a screen, but the present invention is not limited thereto. It should be appreciated that the conversation session of the present invention may be based on real time voice conversations between users, for example via intelligent speakers.
- a computer program may include a storage medium readable by a computer processor or the like, such as an EPROM, an EEPROM, a nonvolatile memory such as a flash memory device, a magnetic disk such as an internal hard disk and a removable disk, a magneto-optical disk, and It may be implemented in a form stored in various types of storage media, including a CDROM disk.
- the program code (s) may be implemented in assembly or machine language. All modifications and changes that fall within the true spirit and scope of the present invention are intended to be covered by the following claims.
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Tourism & Hospitality (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Marketing (AREA)
- Economics (AREA)
- Human Resources & Organizations (AREA)
- Strategic Management (AREA)
- Theoretical Computer Science (AREA)
- General Business, Economics & Management (AREA)
- General Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Multimedia (AREA)
- Acoustics & Sound (AREA)
- Human Computer Interaction (AREA)
- Operations Research (AREA)
- Quality & Reliability (AREA)
- Primary Health Care (AREA)
- General Health & Medical Sciences (AREA)
- Entrepreneurship & Innovation (AREA)
- Development Economics (AREA)
- Probability & Statistics with Applications (AREA)
- Artificial Intelligence (AREA)
- Information Transfer Between Computers (AREA)
Abstract
Description
Claims (10)
- 대화형 AI 에이전트 시스템에 의해 제공되는, 복수 사용자간 대화 세션의 능동적 모니터링에 기초한 주문 또는 예약 서비스 제공 방법으로서,A method of providing an order or reservation service based on active monitoring of a conversation session between multiple users, provided by an interactive AI agent system,상기 대화 세션 중에 상기 복수 사용자 중 하나 이상의 사용자로부터의 사용자 입력 - 상기 사용자 입력은 상기 복수 사용자 간에 주고받는 대화 입력임 - 을 수신하는 단계;Receiving user input from one or more of the plurality of users during the chat session, wherein the user input is a chat input to and from the plurality of users;소정의 지식베이스에 기초하여, 상기 사용자 입력에 대응하는 소정의 사용자 인텐트를 결정하는 단계;Determining a predetermined user intent corresponding to the user input based on a predetermined knowledge base;상기 수신된 사용자 인텐트에 기초하여, 상기 복수 사용자 중 소정 수 이상의 사용자를 위한 소정의 주문 또는 예약을 희망하는 사용자 인텐트가 획득되었는지 판정하는 단계; Based on the received user intent, determining whether a user intent for a given order or reservation for a predetermined number or more of the plurality of users has been obtained;상기 소정 수 이상의 사용자를 위한 상기 소정의 주문 또는 예약을 희망하는 사용자 인텐트가 획득되었다고 판정함에 응답하여, 상기 사용자 인텐트에 부합하는 대화 흐름을 생성하는 단계; In response to determining that a user intent wishing for the predetermined order or reservation for the predetermined number of users has been obtained, generating a conversation flow corresponding to the user intent;상기 소정 수 이상의 사용자를 위한 상기 소정의 주문 또는 예약 이전에 상기 대화형 AI 에이전트 시스템에 의하여 상기 대화 세션 상에서 제공된 주문 또는 예약에 관한 기록을 검색하는 단계; 및Retrieving a record of an order or reservation provided on the conversation session by the interactive AI agent system prior to the predetermined order or reservation for the predetermined number of users; And상기 생성된 대화 흐름 및 상기 검색된 주문 또는 예약에 관한 기록에 기초하여, 상기 소정의 주문 또는 예약을 위한 대화 및/또는 태스크를 수행하는 단계Performing a conversation and / or task for the given order or reservation based on the generated conversation flow and the record regarding the retrieved order or reservation를 포함하는, 능동적 주문 또는 예약 서비스 제공 방법.Including, an active order or reservation service providing method.
- 제1항에 있어서,The method of claim 1,상기 소정의 주문 또는 예약은, 음식, 상품 및 서비스 중 하나와 연관되고,The predetermined order or reservation is associated with one of food, goods and services,상기 대화 및/또는 태스크를 수행하는 단계는, 상기 검색된 주문 또는 예약에 관한 기록으로부터 상기 소정의 주문 또는 예약과 연관된 음식, 상품 및 서비스 중 하나를 제공할 업체를 결정하는 단계를 포함하는, 능동적 주문 또는 예약 서비스 제공 방법.Performing the conversation and / or task includes determining from a record regarding the retrieved order or reservation a company to provide one of the food, goods and services associated with the given order or reservation. Or how to provide reservation services.
- 제1항에 있어서,The method of claim 1,상기 대화 및/또는 태스크를 수행하는 단계는, 상기 검색된 주문 또는 예약에 관한 기록으로부터 상기 소정의 주문 또는 예약과 연관된, 상기 소정 수 이상의 사용자 각각을 위한 각 희망 품목을 결정하는 단계를 포함하는, 능동적 주문 또는 예약 서비스 제공 방법.Performing the conversation and / or task includes actively determining each desired item for each of the predetermined number or more of users associated with the given order or reservation from a record relating to the retrieved order or reservation. How to provide order or reservation service.
- 제1항에 있어서,The method of claim 1,상기 대화 및/또는 태스크를 수행하는 단계는, 상기 검색된 주문 또는 예약에 관한 기록으로부터 상기 소정의 주문 또는 예약과 연관된 결제 정보를 결정하는 단계를 포함하는, 능동적 주문 또는 예약 서비스 제공 방법.And performing the conversation and / or task comprises determining payment information associated with the given order or reservation from a record of the retrieved order or reservation.
- 제1항에 있어서,The method of claim 1,상기 대화 및/또는 태스크를 수행하는 단계는, 상기 검색된 주문 또는 예약에 관한 기록으로부터 상기 소정의 주문 또는 예약과 연관된 주소 정보를 결정하는 단계를 포함하는, 능동적 주문 또는 예약 서비스 제공 방법.And performing the conversation and / or task comprises determining address information associated with the given order or reservation from a record of the retrieved order or reservation.
- 제1항에 있어서,The method of claim 1,상기 소정의 주문 또는 예약은, 음식, 상품 및 서비스 중 하나와 연관되고,The predetermined order or reservation is associated with one of food, goods and services,상기 대화 및/또는 태스크를 수행하는 단계는, 상기 소정의 주문 또는 예약과 연관된 음식, 상품 및 서비스 중 하나를 제공할 업체를 결정하기 위한 대화 및/또는 태스크를 수행하는 단계를 포함하며,Performing the conversation and / or task includes performing a conversation and / or task to determine a company that will provide one of the food, goods and services associated with the given order or reservation,상기 업체를 결정하기 위한 대화 및/또는 태스크를 수행하는 단계는, 복수의 업체의 리스트를 제공하는 단계;Performing a conversation and / or a task for determining a business comprises: providing a list of a plurality of business;상기 소정 수 이상의 사용자 각각으로부터 상기 복수의 업체의 리스트 중 하나의 업체의 선택을 수신하는 단계; 및Receiving a selection of one company from the list of companies from each of the predetermined number of users; And상기 소정 수 이상의 사용자 각각으로부터의 상기 업체의 선택을 수신한 결과에 기초하여, 상기 업체를 결정하는 단계Determining the business based on a result of receiving the selection of the business from each of the predetermined number of users or more;를 포함하는, 능동적 주문 또는 예약 서비스 제공 방법.Including, an active order or reservation service providing method.
- 제1항에 있어서,The method of claim 1,상기 대화 및/또는 태스크를 수행하는 단계는, 상기 소정 수 이상의 사용자 각각의, 상기 소정의 주문 또는 예약과 연관된 각 희망 품목을 결정하기 위한 대화 및/또는 태스크를 수행하는 단계를 포함하고,Performing the conversation and / or task includes performing a conversation and / or task for determining each desired item associated with the predetermined order or reservation for each of the predetermined number of users,상기 소정 수 이상의 사용자 각각의, 상기 소정의 주문 또는 예약과 연관된 각 희망 품목을 결정하기 위한 대화 및/또는 태스크를 수행하는 단계는, Performing a conversation and / or task for each of the predetermined number or more of users to determine each desired item associated with the given order or reservation,복수의 품목의 리스트를 제공하는 단계; 및 Providing a list of a plurality of items; And상기 소정 수 이상의 사용자 각각으로부터 상기 복수의 품목의 리스트 중 하나의 품목의 선택을 수신하는 단계Receiving a selection of an item from the list of the plurality of items from each of the predetermined number of users를 포함하는, 능동적 주문 또는 예약 서비스 제공 방법.Including, an active order or reservation service providing method.
- 제1항에 있어서,The method of claim 1,상기 대화 및/또는 태스크를 수행하는 단계는, 상기 소정의 주문 또는 예약과 연관된 결제를 진행하는 단계를 포함하고,Performing the conversation and / or task includes making a payment associated with the predetermined order or reservation,상기 소정의 주문 또는 예약과 연관된 결제를 진행하는 단계는, 상기 소정 수 이상의 사용자 각각에 관하여 또는 상기 소정 수 이상의 사용자 중 대표자에 관하여 상기 결제를 진행하는 단계를 포함하는, 능동적 주문 또는 예약 서비스 제공 방법.The step of making a payment associated with the predetermined order or reservation includes the step of making the payment with respect to each of the predetermined number or more of users or a representative of the predetermined number or more of users. .
- 하나 이상의 명령어를 포함하는 컴퓨터 판독 가능 기록 매체로서,A computer readable recording medium comprising one or more instructions,상기 하나 이상의 명령어는, 컴퓨터에 위해 실행되는 경우, 상기 컴퓨터로 하여금, 제1항 내지 제8항 중 어느 한 항의 방법을 수행하게 하는, 컴퓨터 판독 가능 기록 매체.The computer readable recording medium of the one or more instructions, when executed for a computer, causes the computer to perform the method of any one of claims 1 to 8.
- 복수 사용자간 대화 세션에 대한 모니터링 및 그에 기초한 능동적 주문 또는 예약 서비스를 제공하도록 구성된 대화형 AI 에이전트 시스템으로서, An interactive AI agent system configured to provide monitoring of a conversation session between multiple users and an active order or reservation service based thereon,통신 모듈;Communication module;NLU 모듈; 및NLU module; And대화 세션 모니터링/개입 모듈Dialog session monitoring / intervention module을 포함하고, Including,상기 통신 모듈은, 상기 대화 세션 중에 상기 복수 사용자 중 하나 이상의 사용자로부터의 사용자 입력 - 상기 사용자 입력은 상기 복수 사용자 간에 주고받는 대화 입력임 - 을 수신하도록 구성되고,The communication module is configured to receive user input from one or more of the plurality of users during the conversation session, the user input being a conversation input to and from the plurality of users;상기 NLU 모듈은, 소정의 지식베이스에 기초하여, 상기 사용자 입력에 대응하는 소정의 사용자 인텐트를 결정하도록 구성되며,The NLU module is configured to determine a predetermined user intent corresponding to the user input based on a predetermined knowledge base.상기 대화 세션 모니터링/개입 모듈은, The chat session monitoring / intervention module,상기 수신된 사용자 인텐트에 기초하여, 상기 복수 사용자 중 소정 수 이상의 사용자를 위한 소정의 주문 또는 예약을 희망하는 사용자 인텐트가 획득되었는지 판정하고, Based on the received user intent, determine whether a user intent for a predetermined order or reservation for a predetermined number or more of the plurality of users has been obtained,상기 소정 수 이상의 사용자를 위한 상기 소정의 주문 또는 예약을 희망하는 사용자 인텐트가 획득되었다고 판정함에 응답하여, 상기 사용자 인텐트에 부합하는 대화 흐름을 생성하고, In response to determining that a user intent wishing for the predetermined order or reservation for the predetermined number of users has been obtained, generate a conversation flow corresponding to the user intent,상기 소정 수 이상의 사용자를 위한 상기 소정의 주문 또는 예약 이전에 상기 대화형 AI 에이전트 시스템에 의하여 상기 대화 세션 상에서 제공된 주문 또는 예약에 관한 기록을 검색하고,Retrieve a record of an order or reservation provided on the conversation session by the interactive AI agent system prior to the predetermined order or reservation for the predetermined number of users,상기 생성된 대화 흐름 및 상기 검색된 주문 또는 예약에 관한 기록에 기초하여, 상기 소정의 주문 또는 예약을 위한 대화 및/또는 태스크를 수행하도록 구성된, Configured to perform a conversation and / or task for the given order or reservation, based on the generated conversation flow and the record regarding the retrieved order or reservation,대화형 AI 에이전트 시스템.Interactive AI Agent System.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020180016716A KR101924217B1 (en) | 2018-02-12 | 2018-02-12 | Interactive ai agent system and method for actively providing an order or reservation service based on monitoring of a dialogue session among users and using previous history occuring in the dialogue session, computer readable recording medium |
KR10-2018-0016716 | 2018-02-12 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2019156535A1 true WO2019156535A1 (en) | 2019-08-15 |
Family
ID=64561150
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/KR2019/001692 WO2019156535A1 (en) | 2018-02-12 | 2019-02-12 | Interactive ai agent system and method for actively providing order or reservation service on basis of monitoring of dialogue session between users by using previous history information in dialogue session, and computer-readable recording medium |
Country Status (2)
Country | Link |
---|---|
KR (1) | KR101924217B1 (en) |
WO (1) | WO2019156535A1 (en) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20060117860A (en) * | 2005-05-30 | 2006-11-17 | (주)다음소프트 | Method and system of providing conversational agent service using images |
KR20140122330A (en) * | 2013-04-09 | 2014-10-20 | 주식회사 엘지유플러스 | Method, apparatus, and recording medium for recommendation based on messenger communication |
KR20150035877A (en) * | 2015-02-25 | 2015-04-07 | 네이버 주식회사 | Method, system and recording medium for transaction processing using real time conversation |
KR20170015082A (en) * | 2015-07-29 | 2017-02-08 | 주식회사 엘지씨엔에스 | Ordering method using messenger application, system performing the same and storage media storing the same |
KR20170133031A (en) * | 2016-05-25 | 2017-12-05 | 김신태 | Speech recongnition delivery order and payment system using smart device |
-
2018
- 2018-02-12 KR KR1020180016716A patent/KR101924217B1/en active IP Right Grant
-
2019
- 2019-02-12 WO PCT/KR2019/001692 patent/WO2019156535A1/en active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20060117860A (en) * | 2005-05-30 | 2006-11-17 | (주)다음소프트 | Method and system of providing conversational agent service using images |
KR20140122330A (en) * | 2013-04-09 | 2014-10-20 | 주식회사 엘지유플러스 | Method, apparatus, and recording medium for recommendation based on messenger communication |
KR20150035877A (en) * | 2015-02-25 | 2015-04-07 | 네이버 주식회사 | Method, system and recording medium for transaction processing using real time conversation |
KR20170015082A (en) * | 2015-07-29 | 2017-02-08 | 주식회사 엘지씨엔에스 | Ordering method using messenger application, system performing the same and storage media storing the same |
KR20170133031A (en) * | 2016-05-25 | 2017-12-05 | 김신태 | Speech recongnition delivery order and payment system using smart device |
Also Published As
Publication number | Publication date |
---|---|
KR101924217B1 (en) | 2018-11-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105320726B (en) | Reduce the demand to manual beginning/end point and triggering phrase | |
WO2019088384A1 (en) | Method for providing rich-expression natural language conversation by modifying reply, computer device and computer-readable recording medium | |
WO2019132135A1 (en) | Interactive ai agent system and method for actively monitoring and intervening in dialogue session between users, and computer readable recording medium | |
CN111837116B (en) | Method for automatically constructing or updating dialog flow management model of dialog type AI system | |
CN112136175A (en) | Voice interaction for accessing calling functionality of companion device at primary device | |
WO2019156536A1 (en) | Method and computer device for constructing or updating knowledge base model for interactive ai agent system by labeling identifiable, yet non-learnable, data from among learning data, and computer-readable recording medium | |
WO2019147039A1 (en) | Method for determining optimal conversation pattern for goal achievement at particular time point during conversation session associated with conversation understanding ai service system, method for determining goal achievement prediction probability, and computer-readable recording medium | |
KR101891498B1 (en) | Method, computer device and computer readable recording medium for multi domain service resolving the mixture of multi-domain intents in interactive ai agent system | |
WO2019088383A1 (en) | Method and computer device for providing natural language conversation by providing interjection response in timely manner, and computer-readable recording medium | |
WO2019156537A1 (en) | Interactive ai agent system and method for actively providing service related to security and like through dialogue session or separate session on basis of monitoring of dialogue session between users, and computer-readable recording medium | |
WO2019088638A1 (en) | Method, computer device and computer readable recording medium for providing natural language conversation by timely providing substantial reply | |
CN109564580A (en) | Promote effectively search using message exchange thread | |
WO2019143170A1 (en) | Method for generating conversation template for conversation-understanding ai service system having predetermined goal, and computer readable recording medium | |
KR20190103951A (en) | Method, computer device and computer readable recording medium for building or updating knowledgebase models for interactive ai agent systen, by labeling identifiable but not-learnable data in training data set | |
KR102017544B1 (en) | Interactive ai agent system and method for providing seamless chatting service among users using multiple messanger program, computer readable recording medium | |
KR20190094080A (en) | Interactive ai agent system and method for actively providing an order or reservation service based on monitoring of a dialogue session among users, computer readable recording medium | |
CN111429896A (en) | Voice interaction for accessing calling functionality of companion device at primary device | |
WO2019156535A1 (en) | Interactive ai agent system and method for actively providing order or reservation service on basis of monitoring of dialogue session between users by using previous history information in dialogue session, and computer-readable recording medium | |
KR102004092B1 (en) | Method for generating relation database for providing seamless chatting service among users using multiple messanger program, computer readable recording medium | |
KR101934582B1 (en) | Interactive ai agent system and method for providing seamless chatting service among users using multiple messanger program, computer readable recording medium | |
EP4055590A1 (en) | Systems and methods for automating voice commands | |
KR20190103928A (en) | Interactive ai agent system and method for actively providing an order or reservation service based on monitoring of a dialogue session among users and using previous history occuring in the dialogue session, computer readable recording medium | |
WO2019066132A1 (en) | User context-based authentication method having enhanced security, interactive ai agent system, and computer-readable recording medium | |
US12112747B2 (en) | Method for controlling external device based on voice and electronic device thereof | |
KR20190103927A (en) | Interactive ai agent system and method for actively providing a security related service based on monitoring of a dialogue session among users via the dialogue session or a separate session, computer readable recording medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 19750285 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 19750285 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 08/04/2021) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 19750285 Country of ref document: EP Kind code of ref document: A1 |