WO2016175354A1 - Dispositif et procédé de conversation intelligente artificielle - Google Patents
Dispositif et procédé de conversation intelligente artificielle Download PDFInfo
- Publication number
- WO2016175354A1 WO2016175354A1 PCT/KR2015/004347 KR2015004347W WO2016175354A1 WO 2016175354 A1 WO2016175354 A1 WO 2016175354A1 KR 2015004347 W KR2015004347 W KR 2015004347W WO 2016175354 A1 WO2016175354 A1 WO 2016175354A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- response
- voice
- user
- conversation
- question
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
Definitions
- the present invention relates to an artificial intelligence dialogue apparatus and method for supporting a dialogue between a person and a robot.
- Chat is a computer or a portable terminal to support the conversation with the other party over the network, it is widely used in the form of chat window messenger.
- chat robot technologies As the necessity increases as a means of communication using natural language between humans and computers (robots) in intelligent agents, various chat robot technologies have been proposed.
- the conventional technology simply answers based on the user's input.
- the passive dialog engine providing only provides a user with not only a lot of heterogeneity, but also requires the user to induce a conversation, the flow of the conversation and the user's interest in the conversation are sharply dropped.
- An object of the present invention is to provide an artificial intelligence device and method for supporting a natural conversation with a user without departing from the present invention.
- an artificial intelligence dialog device may include an input response analysis unit analyzing an input user response, and selecting at least one response scenario among preset scenarios according to an analysis result to respond to a response and a question about a user response. And an output unit for outputting a response control unit for outputting an output command and a silent or conversation start voice, and outputting a response voice and a question voice according to the output command of the reaction control unit.
- the artificial intelligence dialogue apparatus and method according to the present invention actively proceeds a conversation in the order of question transmission, user response reception, and response response to a user response based on a preset scenario, thereby providing only a predetermined answer according to a user input. Rather, it leads to active conversations, thereby minimizing the heterogeneity of the user by conducting conversations with the conversation engine, and enhancing the interest of the conversation.
- FIG. 1 is a block diagram illustrating an artificial intelligence conversation apparatus according to an embodiment of the present invention.
- FIG. 2 is a flowchart illustrating an artificial intelligence conversation method according to an embodiment of the present invention.
- FIG. 1 is a block diagram illustrating an artificial intelligence conversation apparatus according to an embodiment of the present invention.
- Artificial intelligence communication apparatus includes an input unit 100 for receiving a voice from the user utterance, Speech To Text (STT) unit 200 for converting the voice received by the input unit 100 into text, Input response analysis unit 300 for receiving the STT conversion result and analyzing the user response, and at least one response scenario selected from the preset scenarios according to the analysis result, and outputs the response command for the user response and the question And a output unit 500 for outputting a response control unit 400 for transmitting a silent or conversation start voice and outputting a response voice and a question voice according to an output command of the reaction control unit 400.
- STT Speech To Text
- Input unit 100 receives the user's voice through the microphone of the artificial intelligence chat device.
- the artificial intelligence dialogue apparatus performs the output of the question to the user, the input of the answer from the user, the output of the response to the response to the user and the output of the next question according to the response output to the user in order.
- the question according to an embodiment of the present invention is provided at the beginning of a conversation with a user and is expressed by a conversation start voice.
- the conversation start voice is the first question provided through the output unit, or when the silent sound is output, the conversation proceeds in the order of the response voice and the question voice output from the user's first voice input.
- one conversation Supports natural dialogue between the user and the AI dialog within the subject.
- the output unit 500 outputs a conversation start voice, which is a question voice for starting a conversation, based on the application execution environment information before starting the conversation.
- the first embodiment outputs a conversation start voice, which is a question voice, and the conversation proceeds in the order of the user's answer, response voice, and question voice output.
- a conversation start voice which is a question voice
- the conversation proceeds in the order of the user's answer, response voice, and question voice output.
- silence is output
- the voice input of is the starting point of the conversation and the conversation proceeds in the order of response voice and question voice output.
- the application execution environment information may be at least one of a built-in scenario database, a user's personal information, a user's behavior pattern, a record of a previous conversation, and surrounding environment information. If the record of previous conversations is about a company's project, as the application runs, it outputs the question, "How did the project go today?"
- the output unit 500 is a question to start a conversation, not a question related to the company, "good weekend. Is the weather good? ”
- the output unit 500 does not only provide only a predetermined answer based on a user's input, but also provides a user with a question of an appropriate topic as the application is executed, thereby providing a conversation. It's a natural way to start and provide a customized conversation.
- reaction control unit 400 not only commands to select and output a response and a question for a user response from a pre-stored list, but also generates a new response and a question for the user response. It is also possible to print.
- the input unit 100 receives a user's voice input in response to a conversation start voice output or after a silent output, and the STT unit 200 inputs a result of converting the voice of the user into a string.
- the answer analysis unit 300 is provided.
- the input response analysis unit 300 analyzes whether or not the user answer converted into a character string corresponds to a predetermined answer type.
- the structured scenario database stores and manages the selected answers, the general answers, the answers that you want to repeat, and the unrelated answers by their types.
- the selective answer is a type in which the classification according to the answer selection is clearly defined in the user's answer to the question, and examples thereof include positive / negative, spring / summer / fall / winter, and the like.
- the general answer is a type of ambiguity and multiple choices for a question, such as the answer to the question “What kind of exercise do you like?”.
- Repetition is the type that the desired answer corresponds to the answer you want the output to ask again.
- the output unit 500 re-outputs the question immediately output.
- the artificial intelligence dialog device is to modify the scenario based on the user's answer to extract and provide the answers and questions sequentially or to query the user again the question of the category corresponding to the first question It is also possible.
- the input response analysis unit 300 determines which category of a predetermined category the input user's answer belongs to, and displays the result. For example, when it corresponds to the optional answer type, it is determined whether the user response input through sentence analysis is affirmative or negative for the question.
- the reaction controller 400 selects at least one reaction scenario from among scenarios stored in the instrumented scenario database according to the analysis result, and responds based on a reaction scenario according to a subject to which a user answer is applicable. And generate an output command for the question.
- the user can perform a natural conversation with the AI conversation device corresponding to his answer without any dissatisfaction.
- the response control unit 400 transmits a pause command signal to the output unit 500 when new voice data is received from the user during output of the response voice and the question voice of the output unit 500, and then input response analysis unit 300. Resample the responses and questions according to the analysis results according to the new voice data of).
- reaction scenario selected by the reaction controller 400 is determined and modified in real time according to the user's response or the user's comment, and the reaction scenario is appropriately based on the instrumented scenario database. It is corrected.
- the reaction control unit 400 changes the scenario (eg, “Would your friend get married? Where is the marriage ceremony?” Asking the first question on the topic and entering a specific event called wedding attendance). To continue the conversation).
- the output unit 500 outputs text corresponding to the response voice and the question voice through the screen. Accordingly, even in a noisy environment in which the user cannot properly receive a voice from the output unit 500, the user may recognize the response and the question from the text output through the screen, and continue the conversation by uttering the answer. .
- FIG. 2 is a flowchart illustrating an artificial intelligence conversation method according to an embodiment of the present invention.
- an AI conversation method includes outputting a conversation start voice (including silence), receiving a user response (S200), analyzing a user response, and analyzing the result. Selecting at least one reaction scenario from among predetermined scenarios, extracting a response and a question based on the response scenario, and outputting a response voice and a question voice according to the extracted response and question (S400) do.
- step S100 is a step of outputting a question voice or a silence corresponding to a conversation start question to start a conversation, and includes a structured scenario database, a user's personal information, a user's behavior pattern, and a previous conversation.
- the conversation start question is extracted according to the application execution environment information which is at least one of the recording information, and the output information.
- the dialogue is performed in response to the user's answer input by voice in the step S200 and in the order of questions.
- Step S200 converts a user's answer input by voice into text, and provides a sentence for analyzing a user's answer.
- Step S300 performs analysis by determining which type of response type the user answer corresponds to.
- the dialogue is performed in the order of the first question, the user's answer, the response to the user's answer, and the question according to the response (when silence is output, the user's voice input, the response to the user's voice, the question according to the response)
- preset answer types e.g., optional, general, answer you want to repeat, unrelated answer
- the step S400 outputs the text corresponding to the response voice and the question voice through the screen, thereby providing the user with the response text and the question text visually as well as hearing, thereby supporting more accurate recognition of the user. .
- step S600 is a step of determining whether the conversation ends with a criterion.
- the user is confirmed to say goodbye for a predetermined time, if the user does not answer for a predetermined time or longer, the user for a predetermined time or more. If there is no answer, and if the end criterion, such as when there is no user's reply to the voice of the output unit calling the user, the conversation is terminated. From step S200 to step S500 are repeated.
Abstract
La présente invention concerne un dispositif et un procédé de conversation intelligente artificielle qui permettent une conversation entre un humain et un robot. Un dispositif de conversation intelligente artificielle selon un aspect de la présente invention comprend : une unité d'analyse de réponses d'entrée qui analyse une réponse entrée par un utilisateur ; une unité de commande de réponse qui sélectionne au moins un scénario de réponse parmi des scénarios prédéterminés en fonction du résultat de l'analyse et transmet une commande de sortie pour une réponse et une question à la réponse de l'utilisateur ; et une unité de sortie qui délivre un silence ou une voix de début de conversation et délivre une voix de réponse et une voix de question en fonction de la commande de sortie de l'unité de commande de réponse.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/KR2015/004347 WO2016175354A1 (fr) | 2015-04-29 | 2015-04-29 | Dispositif et procédé de conversation intelligente artificielle |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/KR2015/004347 WO2016175354A1 (fr) | 2015-04-29 | 2015-04-29 | Dispositif et procédé de conversation intelligente artificielle |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2016175354A1 true WO2016175354A1 (fr) | 2016-11-03 |
Family
ID=57199748
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/KR2015/004347 WO2016175354A1 (fr) | 2015-04-29 | 2015-04-29 | Dispositif et procédé de conversation intelligente artificielle |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2016175354A1 (fr) |
Cited By (92)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2019000326A1 (fr) * | 2017-06-29 | 2019-01-03 | Microsoft Technology Licensing, Llc | Génération de réponses dans un service de conversation en ligne automatisé |
CN109582763A (zh) * | 2017-09-27 | 2019-04-05 | 韩国电子通信研究院 | 运动图像专家组媒体物联网环境中的答疑系统及方法 |
US10311144B2 (en) | 2017-05-16 | 2019-06-04 | Apple Inc. | Emoji word sense disambiguation |
US10390213B2 (en) | 2014-09-30 | 2019-08-20 | Apple Inc. | Social reminders |
US10395654B2 (en) | 2017-05-11 | 2019-08-27 | Apple Inc. | Text normalization based on a data-driven learning network |
US10417344B2 (en) | 2014-05-30 | 2019-09-17 | Apple Inc. | Exemplar-based natural language processing |
US10438595B2 (en) | 2014-09-30 | 2019-10-08 | Apple Inc. | Speaker identification and unsupervised speaker adaptation techniques |
US10474753B2 (en) | 2016-09-07 | 2019-11-12 | Apple Inc. | Language identification using recurrent neural networks |
US10496705B1 (en) | 2018-06-03 | 2019-12-03 | Apple Inc. | Accelerated task performance |
US10529332B2 (en) | 2015-03-08 | 2020-01-07 | Apple Inc. | Virtual assistant activation |
US10580409B2 (en) | 2016-06-11 | 2020-03-03 | Apple Inc. | Application integration with a digital assistant |
US10592604B2 (en) | 2018-03-12 | 2020-03-17 | Apple Inc. | Inverse text normalization for automatic speech recognition |
US10657966B2 (en) | 2014-05-30 | 2020-05-19 | Apple Inc. | Better resolution when referencing to concepts |
US10681212B2 (en) | 2015-06-05 | 2020-06-09 | Apple Inc. | Virtual assistant aided communication with 3rd party service in a communication session |
US10684703B2 (en) | 2018-06-01 | 2020-06-16 | Apple Inc. | Attention aware virtual assistant dismissal |
US10692504B2 (en) | 2010-02-25 | 2020-06-23 | Apple Inc. | User profiling for voice input processing |
US10699717B2 (en) | 2014-05-30 | 2020-06-30 | Apple Inc. | Intelligent assistant for home automation |
US10714117B2 (en) | 2013-02-07 | 2020-07-14 | Apple Inc. | Voice trigger for a digital assistant |
US10720160B2 (en) | 2018-06-01 | 2020-07-21 | Apple Inc. | Voice interaction at a primary device to access call functionality of a companion device |
US10741181B2 (en) | 2017-05-09 | 2020-08-11 | Apple Inc. | User interface for correcting recognition errors |
US10741185B2 (en) | 2010-01-18 | 2020-08-11 | Apple Inc. | Intelligent automated assistant |
US10748546B2 (en) | 2017-05-16 | 2020-08-18 | Apple Inc. | Digital assistant services based on device capabilities |
US10769385B2 (en) | 2013-06-09 | 2020-09-08 | Apple Inc. | System and method for inferring user intent from speech inputs |
US10839159B2 (en) | 2018-09-28 | 2020-11-17 | Apple Inc. | Named entity normalization in a spoken dialog system |
US10878809B2 (en) | 2014-05-30 | 2020-12-29 | Apple Inc. | Multi-command single utterance input method |
US10892996B2 (en) | 2018-06-01 | 2021-01-12 | Apple Inc. | Variable latency device coordination |
US10909171B2 (en) | 2017-05-16 | 2021-02-02 | Apple Inc. | Intelligent automated assistant for media exploration |
US10930282B2 (en) | 2015-03-08 | 2021-02-23 | Apple Inc. | Competing devices responding to voice triggers |
US10942703B2 (en) | 2015-12-23 | 2021-03-09 | Apple Inc. | Proactive assistance based on dialog communication between devices |
US10956666B2 (en) | 2015-11-09 | 2021-03-23 | Apple Inc. | Unconventional virtual assistant interactions |
US11010561B2 (en) | 2018-09-27 | 2021-05-18 | Apple Inc. | Sentiment prediction from textual data |
US11010127B2 (en) | 2015-06-29 | 2021-05-18 | Apple Inc. | Virtual assistant for media playback |
US11037565B2 (en) | 2016-06-10 | 2021-06-15 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
US11048473B2 (en) | 2013-06-09 | 2021-06-29 | Apple Inc. | Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant |
US11070949B2 (en) | 2015-05-27 | 2021-07-20 | Apple Inc. | Systems and methods for proactively identifying and surfacing relevant content on an electronic device with a touch-sensitive display |
US11120372B2 (en) | 2011-06-03 | 2021-09-14 | Apple Inc. | Performing actions associated with task items that represent tasks to perform |
US11126400B2 (en) | 2015-09-08 | 2021-09-21 | Apple Inc. | Zero latency digital assistant |
US11127397B2 (en) | 2015-05-27 | 2021-09-21 | Apple Inc. | Device voice control |
US11133008B2 (en) | 2014-05-30 | 2021-09-28 | Apple Inc. | Reducing the need for manual start/end-pointing and trigger phrases |
US11140099B2 (en) | 2019-05-21 | 2021-10-05 | Apple Inc. | Providing message response suggestions |
US11169616B2 (en) | 2018-05-07 | 2021-11-09 | Apple Inc. | Raise to speak |
US11170166B2 (en) | 2018-09-28 | 2021-11-09 | Apple Inc. | Neural typographical error modeling via generative adversarial networks |
CN113707139A (zh) * | 2020-09-02 | 2021-11-26 | 南宁玄鸟网络科技有限公司 | 一种人工智能机器人的语音沟通交流服务系统 |
WO2021261664A1 (fr) * | 2020-06-23 | 2021-12-30 | 주식회사 오투오 | Système de service touristique basé sur un dialogue vocal d'intelligence artificielle |
US11217251B2 (en) | 2019-05-06 | 2022-01-04 | Apple Inc. | Spoken notifications |
US11227589B2 (en) | 2016-06-06 | 2022-01-18 | Apple Inc. | Intelligent list reading |
US11231904B2 (en) | 2015-03-06 | 2022-01-25 | Apple Inc. | Reducing response latency of intelligent automated assistants |
US11237797B2 (en) | 2019-05-31 | 2022-02-01 | Apple Inc. | User activity shortcut suggestions |
US11269678B2 (en) | 2012-05-15 | 2022-03-08 | Apple Inc. | Systems and methods for integrating third party services with a digital assistant |
US11289073B2 (en) | 2019-05-31 | 2022-03-29 | Apple Inc. | Device text to speech |
US11301477B2 (en) | 2017-05-12 | 2022-04-12 | Apple Inc. | Feedback analysis of a digital assistant |
US11307752B2 (en) | 2019-05-06 | 2022-04-19 | Apple Inc. | User configurable task triggers |
US11314370B2 (en) | 2013-12-06 | 2022-04-26 | Apple Inc. | Method for extracting salient dialog usage from live data |
US11348582B2 (en) | 2008-10-02 | 2022-05-31 | Apple Inc. | Electronic devices with voice command and contextual data processing capabilities |
US11348573B2 (en) | 2019-03-18 | 2022-05-31 | Apple Inc. | Multimodality in digital assistant systems |
US11360641B2 (en) | 2019-06-01 | 2022-06-14 | Apple Inc. | Increasing the relevance of new available information |
US11380310B2 (en) | 2017-05-12 | 2022-07-05 | Apple Inc. | Low-latency intelligent automated assistant |
US11388291B2 (en) | 2013-03-14 | 2022-07-12 | Apple Inc. | System and method for processing voicemail |
US11405466B2 (en) | 2017-05-12 | 2022-08-02 | Apple Inc. | Synchronization and task delegation of a digital assistant |
US11423908B2 (en) | 2019-05-06 | 2022-08-23 | Apple Inc. | Interpreting spoken requests |
US11423886B2 (en) | 2010-01-18 | 2022-08-23 | Apple Inc. | Task flow identification based on user intent |
US11462215B2 (en) | 2018-09-28 | 2022-10-04 | Apple Inc. | Multi-modal inputs for voice commands |
US11467802B2 (en) | 2017-05-11 | 2022-10-11 | Apple Inc. | Maintaining privacy of personal information |
US11468282B2 (en) | 2015-05-15 | 2022-10-11 | Apple Inc. | Virtual assistant in a communication session |
US11475884B2 (en) | 2019-05-06 | 2022-10-18 | Apple Inc. | Reducing digital assistant latency when a language is incorrectly determined |
US11475898B2 (en) | 2018-10-26 | 2022-10-18 | Apple Inc. | Low-latency multi-speaker speech recognition |
US11488406B2 (en) | 2019-09-25 | 2022-11-01 | Apple Inc. | Text detection using global geometry estimators |
US11496600B2 (en) | 2019-05-31 | 2022-11-08 | Apple Inc. | Remote execution of machine-learned models |
US11500672B2 (en) | 2015-09-08 | 2022-11-15 | Apple Inc. | Distributed personal assistant |
US11516537B2 (en) | 2014-06-30 | 2022-11-29 | Apple Inc. | Intelligent automated assistant for TV user interactions |
US11526368B2 (en) | 2015-11-06 | 2022-12-13 | Apple Inc. | Intelligent automated assistant in a messaging environment |
US11532306B2 (en) | 2017-05-16 | 2022-12-20 | Apple Inc. | Detecting a trigger of a digital assistant |
US11580990B2 (en) | 2017-05-12 | 2023-02-14 | Apple Inc. | User-specific acoustic models |
US11599331B2 (en) | 2017-05-11 | 2023-03-07 | Apple Inc. | Maintaining privacy of personal information |
US11638059B2 (en) | 2019-01-04 | 2023-04-25 | Apple Inc. | Content playback on multiple devices |
US11656884B2 (en) | 2017-01-09 | 2023-05-23 | Apple Inc. | Application integration with a digital assistant |
US11657813B2 (en) | 2019-05-31 | 2023-05-23 | Apple Inc. | Voice identification in digital assistant systems |
US11671920B2 (en) | 2007-04-03 | 2023-06-06 | Apple Inc. | Method and system for operating a multifunction portable electronic device using voice-activation |
US11696060B2 (en) | 2020-07-21 | 2023-07-04 | Apple Inc. | User identification using headphones |
US11710482B2 (en) | 2018-03-26 | 2023-07-25 | Apple Inc. | Natural assistant interaction |
US11755276B2 (en) | 2020-05-12 | 2023-09-12 | Apple Inc. | Reducing description length based on confidence |
US11765209B2 (en) | 2020-05-11 | 2023-09-19 | Apple Inc. | Digital assistant hardware abstraction |
US11790914B2 (en) | 2019-06-01 | 2023-10-17 | Apple Inc. | Methods and user interfaces for voice-based control of electronic devices |
US11798547B2 (en) | 2013-03-15 | 2023-10-24 | Apple Inc. | Voice activated device for use with a voice-based digital assistant |
US11809483B2 (en) | 2015-09-08 | 2023-11-07 | Apple Inc. | Intelligent automated assistant for media search and playback |
US11809783B2 (en) | 2016-06-11 | 2023-11-07 | Apple Inc. | Intelligent device arbitration and control |
US11810578B2 (en) | 2020-05-11 | 2023-11-07 | Apple Inc. | Device arbitration for digital assistant-based intercom systems |
US11838734B2 (en) | 2020-07-20 | 2023-12-05 | Apple Inc. | Multi-device audio adjustment coordination |
US11853536B2 (en) | 2015-09-08 | 2023-12-26 | Apple Inc. | Intelligent automated assistant in a media environment |
US11854539B2 (en) | 2018-05-07 | 2023-12-26 | Apple Inc. | Intelligent automated assistant for delivering content from user experiences |
US11914848B2 (en) | 2020-05-11 | 2024-02-27 | Apple Inc. | Providing relevant data items based on context |
US11928604B2 (en) | 2005-09-08 | 2024-03-12 | Apple Inc. | Method and apparatus for building an intelligent automated assistant |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20020010226A (ko) * | 2000-07-28 | 2002-02-04 | 정명수 | 자연어로 입력된 사용자의 질문을 인공지능 시스템이분석하여 인터넷에 존재하는 정보를 효과적으로 제시하는서비스에 대한방법 |
WO2014010879A1 (fr) * | 2012-07-09 | 2014-01-16 | 엘지전자 주식회사 | Appareil et procédé de reconnaissance vocale |
WO2014088377A1 (fr) * | 2012-12-07 | 2014-06-12 | 삼성전자 주식회사 | Dispositif de reconnaissance vocale et son procédé de commande |
US20140222436A1 (en) * | 2013-02-07 | 2014-08-07 | Apple Inc. | Voice trigger for a digital assistant |
-
2015
- 2015-04-29 WO PCT/KR2015/004347 patent/WO2016175354A1/fr active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20020010226A (ko) * | 2000-07-28 | 2002-02-04 | 정명수 | 자연어로 입력된 사용자의 질문을 인공지능 시스템이분석하여 인터넷에 존재하는 정보를 효과적으로 제시하는서비스에 대한방법 |
WO2014010879A1 (fr) * | 2012-07-09 | 2014-01-16 | 엘지전자 주식회사 | Appareil et procédé de reconnaissance vocale |
WO2014088377A1 (fr) * | 2012-12-07 | 2014-06-12 | 삼성전자 주식회사 | Dispositif de reconnaissance vocale et son procédé de commande |
US20140222436A1 (en) * | 2013-02-07 | 2014-08-07 | Apple Inc. | Voice trigger for a digital assistant |
Non-Patent Citations (1)
Title |
---|
KOREA CREATIVE CONTENT AGENCY: "This Month's Issue, Trend and Prospect of Speech Recognition Technology", CULTURE TECHNOLOGY(CT) IN-DEPTH STUDY, November 2011 (2011-11-01), Retrieved from the Internet <URL:https://www.kocca.kr/knowledge/publication/ct/ksFiles/afieldfile/2011/12/07/87NEmyIcVWMc.pdf> * |
Cited By (139)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11928604B2 (en) | 2005-09-08 | 2024-03-12 | Apple Inc. | Method and apparatus for building an intelligent automated assistant |
US11671920B2 (en) | 2007-04-03 | 2023-06-06 | Apple Inc. | Method and system for operating a multifunction portable electronic device using voice-activation |
US11348582B2 (en) | 2008-10-02 | 2022-05-31 | Apple Inc. | Electronic devices with voice command and contextual data processing capabilities |
US11900936B2 (en) | 2008-10-02 | 2024-02-13 | Apple Inc. | Electronic devices with voice command and contextual data processing capabilities |
US11423886B2 (en) | 2010-01-18 | 2022-08-23 | Apple Inc. | Task flow identification based on user intent |
US10741185B2 (en) | 2010-01-18 | 2020-08-11 | Apple Inc. | Intelligent automated assistant |
US10692504B2 (en) | 2010-02-25 | 2020-06-23 | Apple Inc. | User profiling for voice input processing |
US11120372B2 (en) | 2011-06-03 | 2021-09-14 | Apple Inc. | Performing actions associated with task items that represent tasks to perform |
US11321116B2 (en) | 2012-05-15 | 2022-05-03 | Apple Inc. | Systems and methods for integrating third party services with a digital assistant |
US11269678B2 (en) | 2012-05-15 | 2022-03-08 | Apple Inc. | Systems and methods for integrating third party services with a digital assistant |
US11636869B2 (en) | 2013-02-07 | 2023-04-25 | Apple Inc. | Voice trigger for a digital assistant |
US10978090B2 (en) | 2013-02-07 | 2021-04-13 | Apple Inc. | Voice trigger for a digital assistant |
US11862186B2 (en) | 2013-02-07 | 2024-01-02 | Apple Inc. | Voice trigger for a digital assistant |
US10714117B2 (en) | 2013-02-07 | 2020-07-14 | Apple Inc. | Voice trigger for a digital assistant |
US11557310B2 (en) | 2013-02-07 | 2023-01-17 | Apple Inc. | Voice trigger for a digital assistant |
US11388291B2 (en) | 2013-03-14 | 2022-07-12 | Apple Inc. | System and method for processing voicemail |
US11798547B2 (en) | 2013-03-15 | 2023-10-24 | Apple Inc. | Voice activated device for use with a voice-based digital assistant |
US11727219B2 (en) | 2013-06-09 | 2023-08-15 | Apple Inc. | System and method for inferring user intent from speech inputs |
US11048473B2 (en) | 2013-06-09 | 2021-06-29 | Apple Inc. | Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant |
US10769385B2 (en) | 2013-06-09 | 2020-09-08 | Apple Inc. | System and method for inferring user intent from speech inputs |
US11314370B2 (en) | 2013-12-06 | 2022-04-26 | Apple Inc. | Method for extracting salient dialog usage from live data |
US11133008B2 (en) | 2014-05-30 | 2021-09-28 | Apple Inc. | Reducing the need for manual start/end-pointing and trigger phrases |
US10657966B2 (en) | 2014-05-30 | 2020-05-19 | Apple Inc. | Better resolution when referencing to concepts |
US11670289B2 (en) | 2014-05-30 | 2023-06-06 | Apple Inc. | Multi-command single utterance input method |
US10714095B2 (en) | 2014-05-30 | 2020-07-14 | Apple Inc. | Intelligent assistant for home automation |
US11699448B2 (en) | 2014-05-30 | 2023-07-11 | Apple Inc. | Intelligent assistant for home automation |
US10878809B2 (en) | 2014-05-30 | 2020-12-29 | Apple Inc. | Multi-command single utterance input method |
US10699717B2 (en) | 2014-05-30 | 2020-06-30 | Apple Inc. | Intelligent assistant for home automation |
US10417344B2 (en) | 2014-05-30 | 2019-09-17 | Apple Inc. | Exemplar-based natural language processing |
US11257504B2 (en) | 2014-05-30 | 2022-02-22 | Apple Inc. | Intelligent assistant for home automation |
US11810562B2 (en) | 2014-05-30 | 2023-11-07 | Apple Inc. | Reducing the need for manual start/end-pointing and trigger phrases |
US11516537B2 (en) | 2014-06-30 | 2022-11-29 | Apple Inc. | Intelligent automated assistant for TV user interactions |
US11838579B2 (en) | 2014-06-30 | 2023-12-05 | Apple Inc. | Intelligent automated assistant for TV user interactions |
US10390213B2 (en) | 2014-09-30 | 2019-08-20 | Apple Inc. | Social reminders |
US10438595B2 (en) | 2014-09-30 | 2019-10-08 | Apple Inc. | Speaker identification and unsupervised speaker adaptation techniques |
US11231904B2 (en) | 2015-03-06 | 2022-01-25 | Apple Inc. | Reducing response latency of intelligent automated assistants |
US11087759B2 (en) | 2015-03-08 | 2021-08-10 | Apple Inc. | Virtual assistant activation |
US10930282B2 (en) | 2015-03-08 | 2021-02-23 | Apple Inc. | Competing devices responding to voice triggers |
US10529332B2 (en) | 2015-03-08 | 2020-01-07 | Apple Inc. | Virtual assistant activation |
US11842734B2 (en) | 2015-03-08 | 2023-12-12 | Apple Inc. | Virtual assistant activation |
US11468282B2 (en) | 2015-05-15 | 2022-10-11 | Apple Inc. | Virtual assistant in a communication session |
US11070949B2 (en) | 2015-05-27 | 2021-07-20 | Apple Inc. | Systems and methods for proactively identifying and surfacing relevant content on an electronic device with a touch-sensitive display |
US11127397B2 (en) | 2015-05-27 | 2021-09-21 | Apple Inc. | Device voice control |
US10681212B2 (en) | 2015-06-05 | 2020-06-09 | Apple Inc. | Virtual assistant aided communication with 3rd party service in a communication session |
US11947873B2 (en) | 2015-06-29 | 2024-04-02 | Apple Inc. | Virtual assistant for media playback |
US11010127B2 (en) | 2015-06-29 | 2021-05-18 | Apple Inc. | Virtual assistant for media playback |
US11853536B2 (en) | 2015-09-08 | 2023-12-26 | Apple Inc. | Intelligent automated assistant in a media environment |
US11809483B2 (en) | 2015-09-08 | 2023-11-07 | Apple Inc. | Intelligent automated assistant for media search and playback |
US11954405B2 (en) | 2015-09-08 | 2024-04-09 | Apple Inc. | Zero latency digital assistant |
US11550542B2 (en) | 2015-09-08 | 2023-01-10 | Apple Inc. | Zero latency digital assistant |
US11500672B2 (en) | 2015-09-08 | 2022-11-15 | Apple Inc. | Distributed personal assistant |
US11126400B2 (en) | 2015-09-08 | 2021-09-21 | Apple Inc. | Zero latency digital assistant |
US11526368B2 (en) | 2015-11-06 | 2022-12-13 | Apple Inc. | Intelligent automated assistant in a messaging environment |
US11809886B2 (en) | 2015-11-06 | 2023-11-07 | Apple Inc. | Intelligent automated assistant in a messaging environment |
US10956666B2 (en) | 2015-11-09 | 2021-03-23 | Apple Inc. | Unconventional virtual assistant interactions |
US11886805B2 (en) | 2015-11-09 | 2024-01-30 | Apple Inc. | Unconventional virtual assistant interactions |
US10942703B2 (en) | 2015-12-23 | 2021-03-09 | Apple Inc. | Proactive assistance based on dialog communication between devices |
US11227589B2 (en) | 2016-06-06 | 2022-01-18 | Apple Inc. | Intelligent list reading |
US11657820B2 (en) | 2016-06-10 | 2023-05-23 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
US11037565B2 (en) | 2016-06-10 | 2021-06-15 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
US11152002B2 (en) | 2016-06-11 | 2021-10-19 | Apple Inc. | Application integration with a digital assistant |
US11809783B2 (en) | 2016-06-11 | 2023-11-07 | Apple Inc. | Intelligent device arbitration and control |
US10580409B2 (en) | 2016-06-11 | 2020-03-03 | Apple Inc. | Application integration with a digital assistant |
US11749275B2 (en) | 2016-06-11 | 2023-09-05 | Apple Inc. | Application integration with a digital assistant |
US10474753B2 (en) | 2016-09-07 | 2019-11-12 | Apple Inc. | Language identification using recurrent neural networks |
US11656884B2 (en) | 2017-01-09 | 2023-05-23 | Apple Inc. | Application integration with a digital assistant |
US10741181B2 (en) | 2017-05-09 | 2020-08-11 | Apple Inc. | User interface for correcting recognition errors |
US11467802B2 (en) | 2017-05-11 | 2022-10-11 | Apple Inc. | Maintaining privacy of personal information |
US10395654B2 (en) | 2017-05-11 | 2019-08-27 | Apple Inc. | Text normalization based on a data-driven learning network |
US11599331B2 (en) | 2017-05-11 | 2023-03-07 | Apple Inc. | Maintaining privacy of personal information |
US11301477B2 (en) | 2017-05-12 | 2022-04-12 | Apple Inc. | Feedback analysis of a digital assistant |
US11862151B2 (en) | 2017-05-12 | 2024-01-02 | Apple Inc. | Low-latency intelligent automated assistant |
US11405466B2 (en) | 2017-05-12 | 2022-08-02 | Apple Inc. | Synchronization and task delegation of a digital assistant |
US11538469B2 (en) | 2017-05-12 | 2022-12-27 | Apple Inc. | Low-latency intelligent automated assistant |
US11837237B2 (en) | 2017-05-12 | 2023-12-05 | Apple Inc. | User-specific acoustic models |
US11380310B2 (en) | 2017-05-12 | 2022-07-05 | Apple Inc. | Low-latency intelligent automated assistant |
US11580990B2 (en) | 2017-05-12 | 2023-02-14 | Apple Inc. | User-specific acoustic models |
US10909171B2 (en) | 2017-05-16 | 2021-02-02 | Apple Inc. | Intelligent automated assistant for media exploration |
US11675829B2 (en) | 2017-05-16 | 2023-06-13 | Apple Inc. | Intelligent automated assistant for media exploration |
US10748546B2 (en) | 2017-05-16 | 2020-08-18 | Apple Inc. | Digital assistant services based on device capabilities |
US10311144B2 (en) | 2017-05-16 | 2019-06-04 | Apple Inc. | Emoji word sense disambiguation |
US11532306B2 (en) | 2017-05-16 | 2022-12-20 | Apple Inc. | Detecting a trigger of a digital assistant |
WO2019000326A1 (fr) * | 2017-06-29 | 2019-01-03 | Microsoft Technology Licensing, Llc | Génération de réponses dans un service de conversation en ligne automatisé |
CN109582763B (zh) * | 2017-09-27 | 2023-08-22 | 韩国电子通信研究院 | 运动图像专家组媒体物联网环境中的答疑系统及方法 |
CN109582763A (zh) * | 2017-09-27 | 2019-04-05 | 韩国电子通信研究院 | 运动图像专家组媒体物联网环境中的答疑系统及方法 |
US10592604B2 (en) | 2018-03-12 | 2020-03-17 | Apple Inc. | Inverse text normalization for automatic speech recognition |
US11710482B2 (en) | 2018-03-26 | 2023-07-25 | Apple Inc. | Natural assistant interaction |
US11900923B2 (en) | 2018-05-07 | 2024-02-13 | Apple Inc. | Intelligent automated assistant for delivering content from user experiences |
US11169616B2 (en) | 2018-05-07 | 2021-11-09 | Apple Inc. | Raise to speak |
US11907436B2 (en) | 2018-05-07 | 2024-02-20 | Apple Inc. | Raise to speak |
US11487364B2 (en) | 2018-05-07 | 2022-11-01 | Apple Inc. | Raise to speak |
US11854539B2 (en) | 2018-05-07 | 2023-12-26 | Apple Inc. | Intelligent automated assistant for delivering content from user experiences |
US10984798B2 (en) | 2018-06-01 | 2021-04-20 | Apple Inc. | Voice interaction at a primary device to access call functionality of a companion device |
US11431642B2 (en) | 2018-06-01 | 2022-08-30 | Apple Inc. | Variable latency device coordination |
US11630525B2 (en) | 2018-06-01 | 2023-04-18 | Apple Inc. | Attention aware virtual assistant dismissal |
US10684703B2 (en) | 2018-06-01 | 2020-06-16 | Apple Inc. | Attention aware virtual assistant dismissal |
US10720160B2 (en) | 2018-06-01 | 2020-07-21 | Apple Inc. | Voice interaction at a primary device to access call functionality of a companion device |
US10892996B2 (en) | 2018-06-01 | 2021-01-12 | Apple Inc. | Variable latency device coordination |
US11009970B2 (en) | 2018-06-01 | 2021-05-18 | Apple Inc. | Attention aware virtual assistant dismissal |
US11360577B2 (en) | 2018-06-01 | 2022-06-14 | Apple Inc. | Attention aware virtual assistant dismissal |
US10496705B1 (en) | 2018-06-03 | 2019-12-03 | Apple Inc. | Accelerated task performance |
US10504518B1 (en) | 2018-06-03 | 2019-12-10 | Apple Inc. | Accelerated task performance |
US10944859B2 (en) | 2018-06-03 | 2021-03-09 | Apple Inc. | Accelerated task performance |
US11010561B2 (en) | 2018-09-27 | 2021-05-18 | Apple Inc. | Sentiment prediction from textual data |
US10839159B2 (en) | 2018-09-28 | 2020-11-17 | Apple Inc. | Named entity normalization in a spoken dialog system |
US11170166B2 (en) | 2018-09-28 | 2021-11-09 | Apple Inc. | Neural typographical error modeling via generative adversarial networks |
US11893992B2 (en) | 2018-09-28 | 2024-02-06 | Apple Inc. | Multi-modal inputs for voice commands |
US11462215B2 (en) | 2018-09-28 | 2022-10-04 | Apple Inc. | Multi-modal inputs for voice commands |
US11475898B2 (en) | 2018-10-26 | 2022-10-18 | Apple Inc. | Low-latency multi-speaker speech recognition |
US11638059B2 (en) | 2019-01-04 | 2023-04-25 | Apple Inc. | Content playback on multiple devices |
US11348573B2 (en) | 2019-03-18 | 2022-05-31 | Apple Inc. | Multimodality in digital assistant systems |
US11783815B2 (en) | 2019-03-18 | 2023-10-10 | Apple Inc. | Multimodality in digital assistant systems |
US11217251B2 (en) | 2019-05-06 | 2022-01-04 | Apple Inc. | Spoken notifications |
US11705130B2 (en) | 2019-05-06 | 2023-07-18 | Apple Inc. | Spoken notifications |
US11307752B2 (en) | 2019-05-06 | 2022-04-19 | Apple Inc. | User configurable task triggers |
US11675491B2 (en) | 2019-05-06 | 2023-06-13 | Apple Inc. | User configurable task triggers |
US11475884B2 (en) | 2019-05-06 | 2022-10-18 | Apple Inc. | Reducing digital assistant latency when a language is incorrectly determined |
US11423908B2 (en) | 2019-05-06 | 2022-08-23 | Apple Inc. | Interpreting spoken requests |
US11888791B2 (en) | 2019-05-21 | 2024-01-30 | Apple Inc. | Providing message response suggestions |
US11140099B2 (en) | 2019-05-21 | 2021-10-05 | Apple Inc. | Providing message response suggestions |
US11657813B2 (en) | 2019-05-31 | 2023-05-23 | Apple Inc. | Voice identification in digital assistant systems |
US11289073B2 (en) | 2019-05-31 | 2022-03-29 | Apple Inc. | Device text to speech |
US11237797B2 (en) | 2019-05-31 | 2022-02-01 | Apple Inc. | User activity shortcut suggestions |
US11496600B2 (en) | 2019-05-31 | 2022-11-08 | Apple Inc. | Remote execution of machine-learned models |
US11360739B2 (en) | 2019-05-31 | 2022-06-14 | Apple Inc. | User activity shortcut suggestions |
US11790914B2 (en) | 2019-06-01 | 2023-10-17 | Apple Inc. | Methods and user interfaces for voice-based control of electronic devices |
US11360641B2 (en) | 2019-06-01 | 2022-06-14 | Apple Inc. | Increasing the relevance of new available information |
US11488406B2 (en) | 2019-09-25 | 2022-11-01 | Apple Inc. | Text detection using global geometry estimators |
US11765209B2 (en) | 2020-05-11 | 2023-09-19 | Apple Inc. | Digital assistant hardware abstraction |
US11810578B2 (en) | 2020-05-11 | 2023-11-07 | Apple Inc. | Device arbitration for digital assistant-based intercom systems |
US11914848B2 (en) | 2020-05-11 | 2024-02-27 | Apple Inc. | Providing relevant data items based on context |
US11924254B2 (en) | 2020-05-11 | 2024-03-05 | Apple Inc. | Digital assistant hardware abstraction |
US11755276B2 (en) | 2020-05-12 | 2023-09-12 | Apple Inc. | Reducing description length based on confidence |
WO2021261664A1 (fr) * | 2020-06-23 | 2021-12-30 | 주식회사 오투오 | Système de service touristique basé sur un dialogue vocal d'intelligence artificielle |
US11838734B2 (en) | 2020-07-20 | 2023-12-05 | Apple Inc. | Multi-device audio adjustment coordination |
US11696060B2 (en) | 2020-07-21 | 2023-07-04 | Apple Inc. | User identification using headphones |
US11750962B2 (en) | 2020-07-21 | 2023-09-05 | Apple Inc. | User identification using headphones |
CN113707139A (zh) * | 2020-09-02 | 2021-11-26 | 南宁玄鸟网络科技有限公司 | 一种人工智能机器人的语音沟通交流服务系统 |
CN113707139B (zh) * | 2020-09-02 | 2024-04-09 | 南宁玄鸟网络科技有限公司 | 一种人工智能机器人的语音沟通交流服务系统 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2016175354A1 (fr) | Dispositif et procédé de conversation intelligente artificielle | |
WO2021051506A1 (fr) | Procédé et appareil d'interaction vocale, dispositif informatique et support de stockage | |
KR20190095181A (ko) | 인공 지능을 이용한 화상 회의 시스템 | |
WO2011074771A2 (fr) | Appareil et procédé permettant l'étude d'une langue étrangère | |
US20140214426A1 (en) | System and method for improving voice communication over a network | |
CN109961792A (zh) | 用于识别语音的方法和装置 | |
CN110047481A (zh) | 用于语音识别的方法和装置 | |
JP2012530954A (ja) | 言語コミュニケーションを改善する方法と装置 | |
CN109712610A (zh) | 用于识别语音的方法和装置 | |
JP6689953B2 (ja) | 通訳サービスシステム、通訳サービス方法及び通訳サービスプログラム | |
CN111063346A (zh) | 基于机器学习的跨媒体明星情感陪伴交互系统 | |
KR20080114100A (ko) | 컴퓨터 주도형 대화 장치 및 방법 | |
WO2019142976A1 (fr) | Procédé de commande d'affichage, support d'enregistrement lisible par ordinateur, et dispositif informatique pour afficher une réponse de conversation candidate pour une entrée de parole d'utilisateur | |
CN113630309B (zh) | 机器人会话系统、方法、装置、计算机设备和存储介质 | |
KR20120073557A (ko) | 음성인식기반 지능형 로봇 시스템 | |
KR100677435B1 (ko) | 대화형 어학 학습 시스템 및 방법 | |
KR20220140301A (ko) | 인공지능을 통해 학습자 식별이 가능한 화상 학습 시스템 및 그 방법 | |
KR20220140304A (ko) | 학습자의 음성 명령을 인식하는 화상 학습 시스템 및 그 방법 | |
CN110675856A (zh) | 用于呼叫中心的人机对话方法及装置 | |
JP2018055155A (ja) | 音声対話装置および音声対話方法 | |
CN112309183A (zh) | 适用于外语教学的交互式听说练习系统 | |
CN110519470A (zh) | 一种语音处理方法、服务器和语音接入装置 | |
KR102577643B1 (ko) | 온라인 일대일 한국어 강의 플랫폼 시스템 및 이에 포함된 운영 서버 | |
KR102364935B1 (ko) | 5g 기반의 음성인식 반응속도 개선을 위한 데이터 전송 방법 및 장치 | |
JP6349149B2 (ja) | レッスン進行システム、レッスン進行方法およびレッスン進行プログラム |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 15890795 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 14/02/2018) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 15890795 Country of ref document: EP Kind code of ref document: A1 |