JPWO2021183681A5 - - Google Patents

Download PDF

Info

Publication number
JPWO2021183681A5
JPWO2021183681A5 JP2022555120A JP2022555120A JPWO2021183681A5 JP WO2021183681 A5 JPWO2021183681 A5 JP WO2021183681A5 JP 2022555120 A JP2022555120 A JP 2022555120A JP 2022555120 A JP2022555120 A JP 2022555120A JP WO2021183681 A5 JPWO2021183681 A5 JP WO2021183681A5
Authority
JP
Japan
Prior art keywords
user
interpretations
primary
results
user input
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2022555120A
Other languages
Japanese (ja)
Other versions
JP2023518026A (en
Publication date
Application filed filed Critical
Priority claimed from PCT/US2021/021767 external-priority patent/WO2021183681A1/en
Publication of JP2023518026A publication Critical patent/JP2023518026A/en
Publication of JPWO2021183681A5 publication Critical patent/JPWO2021183681A5/ja
Pending legal-status Critical Current

Links

Images

Claims (19)

仮想支援を提供するための方法であって、
行動又は情報に対するユーザ要求を含むユーザ入力を受信することと、
前記ユーザ入力の2つ以上の一次解釈を生成するために前記ユーザ入力を処理することによって前記ユーザ入力に対する前記2つ以上の一次解釈を生成することを含み、前記2つ以上の一次解釈は、前記ユーザ入力の一意の可能な転写を含み、前記方法はさらに、
代替解釈を形成するために前記一次解釈のうちの1つ以上を処理することによって、前記2つ以上の一次解釈のうちの1つ以上に対する1つ以上の二次解釈を生成することと、
前記2つ以上の一次解釈及び前記1つ以上の二次解釈に応答して1つ以上の一次行動を決定することと、
前記1つ以上の一次行動の実行から1つ以上の結果を準備することと、
前記1つ以上の一次行動のうちの少なくとも1つに応答して1つ以上の二次行動が存在するか否かを決定することと、
前記1つ以上の二次行動が存在する場合、さらなる追加の行動を予測することができなくなるまで、前記一次解釈のうちの2つ以上、前記1つ以上の二次解釈、前記1つ以上の一次行動、及び前記1つ以上の二次行動を処理し続けることと
1つ以上の追加の行動が存在しないことを受けて、追加の行動が予測されない前記1つ以上の結果を1つ以上の最終結果として指定することと、
前記1つ以上の最終結果をスコア付けすることと、
最も高いスコアを有する前記最終結果を最上位結果として指定することと、
少なくとも前記最上位結果をユーザに出力するか、又は前記最上位結果によって定義される行動をとることと
を含む、方法。
A method for providing virtual assistance, the method comprising:
receiving user input including a user request for action or information;
generating the two or more primary interpretations of the user input by processing the user input to generate two or more primary interpretations of the user input, the two or more primary interpretations comprising: including a unique possible transcription of the user input, the method further comprising:
generating one or more secondary interpretations for one or more of the two or more primary interpretations by processing one or more of the primary interpretations to form an alternative interpretation;
determining one or more primary actions in response to the two or more primary interpretations and the one or more secondary interpretations;
preparing one or more results from performing the one or more primary actions;
determining whether one or more secondary behaviors are present in response to at least one of the one or more primary behaviors;
If said one or more secondary actions are present, two or more of said primary interpretations, said one or more secondary interpretations, said one or more continuing to process the primary behavior and the one or more secondary behaviors; specifying it as the final result, and
scoring the one or more final results;
designating the final result with the highest score as the top result;
outputting at least the top result to a user or taking an action defined by the top result.
スコア付けは、
会話状態に基づく第1のスコア付け因子であって、前記会話状態が、前記つ以上の一次解釈、前記1つ以上の二次解釈、前記1つ以上の行動、前記1つ以上の結果を含む、第1のスコア付け因子、
ユーザプロファイルに基づく第2のスコア付け因子であって、前記ユーザプロファイルが、1つ以上のサーバに記憶されたユーザ選好及びユーザ履歴を含む、第2のスコア付け因子、
補助メタデータに基づく第3のスコア付け因子であって、前記補助メタデータが、ユーザ選好に関連しないとともにユーザ履歴に関連しない、前記1つ以上のサーバに記憶されたデータを含む、第3のスコア付け因子
のうちの1つ以上の要因に基づく、請求項1に記載の方法。
Scoring is
a first scoring factor based on a conversation state, wherein the conversation state includes the two or more primary interpretations, the one or more secondary interpretations, the one or more actions, and the one or more outcomes; a first scoring factor comprising;
a second scoring factor based on a user profile, the user profile including user preferences and user history stored on one or more servers;
a third scoring factor based on auxiliary metadata, the auxiliary metadata comprising data stored on the one or more servers that is not related to user preferences and not related to user history; 2. The method of claim 1, wherein the method is based on one or more of the following scoring factors.
前記ユーザ入力は、ユーザが話す発話である、請求項1または2に記載の方法。 3. The method of claim 1 or 2 , wherein the user input is an utterance spoken by a user. 前記ユーザ入力に対する2つ以上の一次解釈を生成することは、同時に並列に行われる、請求項1~3のいずれか1項に記載の方法。 A method according to any one of claims 1 to 3 , wherein generating two or more primary interpretations for the user input is performed simultaneously and in parallel. 前記つ以上の一次解釈又は1つ以上の二次解釈のいずれが正しいかを明確にするように前記ユーザに要求することをさらに含む、請求項1~4のいずれか1項に記載の方法。 The method according to any one of claims 1 to 4, further comprising requesting the user to clarify which of the two or more primary interpretations or one or more secondary interpretations is correct. . 前記方法は、ユーザデバイスのオペレーティングシステム上で動作する人工知能層によって実行される、請求項1~5のいずれか1項に記載の方法。 A method according to any one of claims 1 to 5 , wherein the method is performed by an artificial intelligence layer running on the operating system of the user device. 少なくとも前記最上位結果を前記ユーザに出力すること、又は前記最上位結果によって定義された行動をとることは、曲を再生すること、電話呼を開始すること、前記ユーザに情報を提供すること、ビデオを再生すること、テキストメッセージを送信すること、ビデオを記録すること、ユーザデバイスから情報を送信すること、及び照明を制御することのうちの1つ以上を含む、請求項1~6のいずれか1項に記載の方法。 Outputting at least the top result to the user or taking an action defined by the top result may include playing a song, initiating a telephone call, providing information to the user; Any of claims 1 to 6 , comprising one or more of: playing a video, sending a text message, recording a video, sending information from a user device, and controlling lighting. or the method described in paragraph 1 . 仮想アシスタントシステムであって、
ユーザから入力を受け取り、前記ユーザに応答を提供するように構成されたユーザインターフェースと、
機械実行可能コードを実行するように構成されたプロセッサと、
非一時的機械実行可能コードを記憶するメモリと
を備え、前記機械実行可能コードは、
ーザ入力を処理して2つ以上の一次解釈を生成するように構成され、前記2つ以上の一次解釈は、前記ユーザ入力の一意の可能な転写を含み、前記機械実行可能コードはさらに、
代替解釈を形成するために前記一次解釈のうちの1つ以上を処理することによって、前記2つ以上の一次解釈のうちの1つ以上に基づいて1つ以上の二次解釈を生成し、
前記一次解釈及び代替解釈を処理して、2つ以上の最終状態をもたらす結果を生成し、
前記2つ以上の最終状態をスコア付けして、最上位にランク付けされた最終状態が最上位結果であるように、前記2つ以上の最終状態をランク付けし、
前記最上位結果を前記ユーザに提示するか、又は前記最上位結果を前記ユーザに対して実行する
ように構成されている、仮想アシスタントシステム。
A virtual assistant system,
a user interface configured to receive input from a user and provide a response to the user;
a processor configured to execute machine-executable code;
a memory storing non-transitory machine-executable code, the machine-executable code comprising:
The machine executable code is configured to process user input to generate two or more primary interpretations, the two or more primary interpretations including unique possible transcriptions of the user input, and the machine executable code further comprises: ,
generating one or more secondary interpretations based on one or more of the two or more primary interpretations by processing one or more of the primary interpretations to form an alternative interpretation;
processing the primary interpretation and alternative interpretations to produce a result that results in two or more end states;
scoring the two or more final states to rank the two or more final states such that the highest ranked final state is the top result;
A virtual assistant system configured to present the top results to the user or run the top results to the user.
前記ユーザインターフェースは、マイクロフォン及びスピーカを含む、請求項8に記載のシステム。 9. The system of claim 8, wherein the user interface includes a microphone and a speaker. トランシーバをさらに備え、前記トランシーバは、第2の仮想アシスタント機械実行可能コードを実行して、前記仮想アシスタントシステムが前記ユーザの前記最上位結果を生成することを支援するように構成された第2のデバイスとネットワークを介して通信するように構成されている、請求項8または9に記載のシステム。 further comprising a transceiver, the transceiver configured to execute a second virtual assistant machine executable code to assist the virtual assistant system in generating the top results for the user. 10. The system of claim 8 or 9 , configured to communicate with a device via a network. 前記仮想アシスタントシステムは、スマートフォンである、請求項8~10のいずれか1項に記載のシステム。 The system according to any one of claims 8 to 10 , wherein the virtual assistant system is a smartphone. 複数の最終状態は、前記ユーザによる検討及び選択のために、前記ユーザに提示される、請求項8~11のいずれか1項に記載のシステム。 A system according to any one of claims 8 to 11 , wherein a plurality of final states are presented to the user for consideration and selection by the user. 前記最上位結果を実行することは、テキストの表示、画像の表示、音楽の再生、ビデオの再生、トランザクションの実行、及びデバイスのオン/オフの切り替えのうちの1つの行動を含む、請求項8~12のいずれか1項に記載のシステム。 9. Executing the top result includes one of the following actions: displaying text, displaying an image, playing music, playing a video, performing a transaction, and turning a device on/off. The system according to any one of items 1 to 12 . 前記機械実行可能コードは、
一次解釈、代替解釈、結果、及び最終状態のうちの1つ以上に関する追加情報を要求するフィードバックを前記ユーザに提示し、
前記ユーザからの追加情報の受信に応答して、前記追加情報を処理して追加の代替解釈を生成するか、又は前記2つ以上の最終状態を再スコア付けする
ようにさらに構成されている、請求項8~13のいずれか1項に記載のシステム。
The machine executable code is
providing feedback to the user requesting additional information regarding one or more of a primary interpretation, an alternative interpretation, a result, and a final state;
further configured to, in response to receiving additional information from the user, process the additional information to generate additional alternative interpretations or rescore the two or more final states; The system according to any one of claims 8 to 13 .
仮想支援を提供するための方法であって、
行動又は情報に対する要求を含むユーザ入力を受信することと、
前記ユーザ入力を処理することによって前記ユーザ入力の2つ以上の解釈を生成することを含み、前記2つ以上の解釈は、前記ユーザ入力の一意の可能な転写を含み、前記方法はさらに、
1つ以上の一次エージェントが少なくとも1つの解釈を処理するように構成されていることに基づいて、前記2つ以上の解釈のうちの少なくとも1つを前記1つ以上の一次エージェントと照合することと、
前記1つ以上の一次エージェントによって、前記2つ以上の解釈のうちの少なくとも1つを処理するように構成された1つ以上のスキルを選択することと、
前記1つ以上のスキルを用いて、前記2つ以上の解釈のうちの前記少なくとも1つを処理することによって、1つ以上の結果を生成することと、
二次エージェントのうちの1つ以上による結果のさらなる処理のために、1つ以上の二次エージェントが前記1つ以上の結果に一致し得るか否かを判定することと、
1つ以上の二次エージェントが一致した場合、前記1つ以上の結果を処理し続けて追加の結果を生成することと、
前記1つ以上の結果のうちの少なくとも1つ及び前記追加の結果のうちの少なくとも1つを、2つ以上の最終結果として指定することと、
前記2つ以上の最終結果をスコア付けすることと、
最も高いスコアを有する前記最終結果を最上位結果として指定することと、
少なくとも前記最上位結果をユーザに出力すること、又は、前記最上位結果によって定義された行動をとることと
を含む、方法。
A method for providing virtual assistance, the method comprising:
receiving user input including a request for action or information;
generating two or more interpretations of the user input by processing the user input, the two or more interpretations including unique possible transcriptions of the user input, the method further comprising:
matching at least one of the two or more interpretations to the one or more primary agents based on the one or more primary agents being configured to process at least one interpretation; And,
selecting one or more skills configured to process at least one of the two or more interpretations by the one or more primary agents;
generating one or more results by processing the at least one of the two or more interpretations using the one or more skills;
determining whether one or more secondary agents can match the one or more results for further processing of the results by one or more of the secondary agents;
if one or more secondary agents match, continuing to process the one or more results to generate additional results;
designating at least one of the one or more results and at least one of the additional results as two or more final results;
scoring the two or more final results;
designating the final result with the highest score as the top result;
A method comprising: outputting at least the top result to a user; or taking an action defined by the top result.
エージェントは、並列仮説推論を実行するために実行可能なソフトウェアモジュール又はルーチンである、請求項15に記載の方法。 16. The method of claim 15, wherein the agent is a software module or routine executable to perform parallel hypothesis reasoning. スキルは、単一のユーザクエリに応答してタスクを実行するか、又は結果を生成するように実行可能なソフトウェアモジュール又はルーチンである、請求項15または16に記載の方法。 17. The method of claim 15 or 16 , wherein a skill is a software module or routine executable to perform a task or generate a result in response to a single user query. 前記一次解釈のうちの少なくとも1つに対する1つ以上の二次解釈を生成することをさらに含む、請求項15~17のいずれか1項に記載の方法。 18. A method according to any one of claims 15 to 17, further comprising generating one or more secondary interpretations for at least one of the primary interpretations. ユーザ入力を受信することは、前記ユーザから発話を受信することと、前記発話をデジタル信号に変換することとを含む、請求項15~18のいずれか1項に記載の方法。 A method according to any one of claims 15 to 18 , wherein receiving user input comprises receiving utterances from the user and converting the utterances into digital signals.
JP2022555120A 2020-03-10 2021-03-10 Parallel Hypothesis Inference for Enhancing Multilingual, Multiturn, and Multidomain Virtual Assistants Pending JP2023518026A (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US202062987814P 2020-03-10 2020-03-10
US62/987,814 2020-03-10
PCT/US2021/021767 WO2021183681A1 (en) 2020-03-10 2021-03-10 Parallel hypothetical reasoning to power a multi-lingual, multi-turn, multi-domain virtual assistant

Publications (2)

Publication Number Publication Date
JP2023518026A JP2023518026A (en) 2023-04-27
JPWO2021183681A5 true JPWO2021183681A5 (en) 2024-03-11

Family

ID=77665251

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2022555120A Pending JP2023518026A (en) 2020-03-10 2021-03-10 Parallel Hypothesis Inference for Enhancing Multilingual, Multiturn, and Multidomain Virtual Assistants

Country Status (6)

Country Link
US (1) US11869497B2 (en)
EP (1) EP4118538A4 (en)
JP (1) JP2023518026A (en)
KR (1) KR20230010624A (en)
CN (1) CN115668206A (en)
WO (1) WO2021183681A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11763259B1 (en) 2020-02-20 2023-09-19 Asana, Inc. Systems and methods to generate units of work in a collaboration environment
US11900323B1 (en) * 2020-06-29 2024-02-13 Asana, Inc. Systems and methods to generate units of work within a collaboration environment based on video dictation
US11809222B1 (en) 2021-05-24 2023-11-07 Asana, Inc. Systems and methods to generate units of work within a collaboration environment based on selection of text
US11836681B1 (en) 2022-02-17 2023-12-05 Asana, Inc. Systems and methods to generate records within a collaboration environment
US11997425B1 (en) 2022-02-17 2024-05-28 Asana, Inc. Systems and methods to generate correspondences between portions of recorded audio content and records of a collaboration environment

Family Cites Families (98)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020032591A1 (en) 2000-09-08 2002-03-14 Agentai, Inc. Service request processing performed by artificial intelligence systems in conjunctiion with human intervention
KR20020030545A (en) 2000-10-18 2002-04-25 남 데이비드 이 Automatic answer and search method - based on artificial intelligence and natural languane process technology - for natural and sentencial questions.
US6996064B2 (en) 2000-12-21 2006-02-07 International Business Machines Corporation System and method for determining network throughput speed and streaming utilization
JP2007512860A (en) 2003-11-04 2007-05-24 クアンタム・インテック・インコーポレーテッド Systems and methods for promoting physiological harmony using respiratory training
US7831564B1 (en) 2003-12-16 2010-11-09 Symantec Operating Corporation Method and system of generating a point-in-time image of at least a portion of a database
US7620549B2 (en) * 2005-08-10 2009-11-17 Voicebox Technologies, Inc. System and method of supporting adaptive misrecognition in conversational speech
US20070043736A1 (en) 2005-08-22 2007-02-22 Microsoft Corporation Smart find
KR100657331B1 (en) 2005-08-24 2006-12-14 삼성전자주식회사 Apparaus and method for forming image using multi-processor
US8335767B2 (en) 2007-10-17 2012-12-18 Oracle International Corporation Maintaining and utilizing SQL execution plan histories
KR20100035391A (en) 2008-09-26 2010-04-05 웅진코웨이주식회사 Valve module for changing flow paths and soft water apparatu
KR101042515B1 (en) 2008-12-11 2011-06-17 주식회사 네오패드 Method for searching information based on user's intention and method for providing information
US20100205222A1 (en) 2009-02-10 2010-08-12 Tom Gajdos Music profiling
US8326637B2 (en) * 2009-02-20 2012-12-04 Voicebox Technologies, Inc. System and method for processing multi-modal device interactions in a natural language voice services environment
TWI432347B (en) 2011-03-11 2014-04-01 Wistron Corp Holder device which could adjust positions automatically, and the combination of the holder device and the electronic device
US8954431B2 (en) 2011-06-09 2015-02-10 Xerox Corporation Smart collaborative brainstorming tool
US9009041B2 (en) * 2011-07-26 2015-04-14 Nuance Communications, Inc. Systems and methods for improving the accuracy of a transcription using auxiliary data such as personal data
US8762156B2 (en) * 2011-09-28 2014-06-24 Apple Inc. Speech recognition repair using contextual information
US9542956B1 (en) 2012-01-09 2017-01-10 Interactive Voice, Inc. Systems and methods for responding to human spoken audio
US9280610B2 (en) * 2012-05-14 2016-03-08 Apple Inc. Crowd sourcing information to fulfill user requests
KR101399472B1 (en) 2012-08-13 2014-06-27 (주)투비소프트 Method and apparatus for rendering processing by using multiple processings
WO2014040175A1 (en) 2012-09-14 2014-03-20 Interaxon Inc. Systems and methods for collecting, analyzing, and sharing bio-signal and non-bio-signal data
KR20140078169A (en) 2012-12-17 2014-06-25 삼성전자주식회사 Imaging apparatus, magnetic resonance imaging and method for controlling the imaging apparatus or the magnetic resonance imaging apparatus
WO2014107795A1 (en) 2013-01-08 2014-07-17 Interaxon Inc. Adaptive brain training computer system and method
CN113470640B (en) * 2013-02-07 2022-04-26 苹果公司 Voice trigger of digital assistant
US9172747B2 (en) 2013-02-25 2015-10-27 Artificial Solutions Iberia SL System and methods for virtual assistant networks
KR102014665B1 (en) 2013-03-15 2019-08-26 애플 인크. User training by intelligent digital assistant
US9058805B2 (en) * 2013-05-13 2015-06-16 Google Inc. Multiple recognizer speech recognition
US10390732B2 (en) 2013-08-14 2019-08-27 Digital Ally, Inc. Breath analyzer, system, and computer program for authenticating, preserving, and presenting breath analysis data
WO2015057586A1 (en) 2013-10-14 2015-04-23 Yahoo! Inc. Systems and methods for providing context-based user interface
US9721570B1 (en) * 2013-12-17 2017-08-01 Amazon Technologies, Inc. Outcome-oriented dialogs on a speech recognition platform
TWM483638U (en) 2014-03-31 2014-08-01 Taer Innovation Co Ltd Stand
US20150288857A1 (en) 2014-04-07 2015-10-08 Microsoft Corporation Mount that facilitates positioning and orienting a mobile computing device
US9607102B2 (en) * 2014-09-05 2017-03-28 Nuance Communications, Inc. Task switching in dialogue processing
US10402460B1 (en) 2014-09-08 2019-09-03 Amazon Technologies, Inc. Contextual card generation and delivery
US9774682B2 (en) 2015-01-08 2017-09-26 International Business Machines Corporation Parallel data streaming between cloud-based applications and massively parallel systems
US10756963B2 (en) 2015-03-17 2020-08-25 Pulzze Systems, Inc. System and method for developing run time self-modifying interaction solution through configuration
US10395021B2 (en) 2015-06-29 2019-08-27 Mesh Candy, Inc. Security and identification system and method using data collection and messaging over a dynamic mesh network with multiple protocols
US10582011B2 (en) 2015-08-06 2020-03-03 Samsung Electronics Co., Ltd. Application cards based on contextual data
US11587559B2 (en) 2015-09-30 2023-02-21 Apple Inc. Intelligent device identification
US10249207B2 (en) 2016-01-19 2019-04-02 TheBeamer, LLC Educational teaching system and method utilizing interactive avatars with learning manager and authoring manager functions
US10872306B2 (en) 2016-04-06 2020-12-22 Smiota, Inc. Facilitating retrieval of items from an electronic device
KR102656806B1 (en) 2016-04-28 2024-04-12 엘지전자 주식회사 Watch type terminal and method of contolling the same
US10631743B2 (en) 2016-05-23 2020-04-28 The Staywell Company, Llc Virtual reality guided meditation with biofeedback
US10156775B2 (en) 2016-06-01 2018-12-18 Eric Zimmermann Extensible mobile recording device holder
DK179309B1 (en) * 2016-06-09 2018-04-23 Apple Inc Intelligent automated assistant in a home environment
US20170357910A1 (en) 2016-06-10 2017-12-14 Apple Inc. System for iteratively training an artificial intelligence using cloud-based metrics
WO2017222503A1 (en) 2016-06-21 2017-12-28 Hewlett-Packard Development Company, L.P. Communications utilizing multiple virtual assistant services
US10244122B2 (en) 2016-07-21 2019-03-26 Vivint, Inc. Panel control over broadband
WO2018022085A1 (en) 2016-07-29 2018-02-01 Hewlett-Packard Development Company, L.P. Identification of preferred communication devices
US9654598B1 (en) 2016-08-08 2017-05-16 Le Technology, Inc. User customization of cards
US20180054228A1 (en) 2016-08-16 2018-02-22 I-Tan Lin Teleoperated electronic device holder
US10798548B2 (en) 2016-08-22 2020-10-06 Lg Electronics Inc. Method for controlling device by using Bluetooth technology, and apparatus
US10423685B2 (en) 2016-09-09 2019-09-24 Robert Bosch Gmbh System and method for automatic question generation from knowledge base
US9959861B2 (en) * 2016-09-30 2018-05-01 Robert Bosch Gmbh System and method for speech recognition
US10855714B2 (en) 2016-10-31 2020-12-01 KnowBe4, Inc. Systems and methods for an artificial intelligence driven agent
US11429586B2 (en) 2016-12-22 2022-08-30 Sap Se Expression update validation
US10365932B2 (en) 2017-01-23 2019-07-30 Essential Products, Inc. Dynamic application customization for automated environments
US20180232920A1 (en) 2017-02-10 2018-08-16 Microsoft Technology Licensing, Llc Contextually aware location selections for teleconference monitor views
KR102384641B1 (en) 2017-02-20 2022-04-08 엘지전자 주식회사 Method for controlling an intelligent system that performs multilingual processing
DK3628101T3 (en) 2017-04-28 2023-09-18 Better Therapeutics Inc METHOD AND SYSTEM FOR ADMINISTRATION OF LIFESTYLE AND HEALTH INTERVENTIONS
DK201770428A1 (en) * 2017-05-12 2019-02-18 Apple Inc. Low-latency intelligent automated assistant
US10554595B2 (en) 2017-05-22 2020-02-04 Genesys Telecommunications Laboratories, Inc. Contact center system and method for advanced outbound communications to a contact group
CN107423364B (en) 2017-06-22 2024-01-26 百度在线网络技术(北京)有限公司 Method, device and storage medium for answering operation broadcasting based on artificial intelligence
EP3435642A1 (en) 2017-07-29 2019-01-30 Advanced Digital Broadcast S.A. A system and method for remote control of appliances by voice
US20190122121A1 (en) 2017-10-23 2019-04-25 AISA Innotech Inc. Method and system for generating individual microdata
US11227448B2 (en) 2017-11-14 2022-01-18 Nvidia Corporation Cloud-centric platform for collaboration and connectivity on 3D virtual environments
US11295735B1 (en) 2017-12-13 2022-04-05 Amazon Technologies, Inc. Customizing voice-control for developer devices
US11250336B2 (en) 2017-12-28 2022-02-15 Intel Corporation Distributed and contextualized artificial intelligence inference service
US10963499B2 (en) 2017-12-29 2021-03-30 Aiqudo, Inc. Generating command-specific language model discourses for digital assistant interpretation
US10729399B2 (en) 2018-03-01 2020-08-04 KUB Technologies, Inc. System and method for cabinet X-ray system with camera and X-ray images superimposition
EP3559946B1 (en) 2018-03-07 2020-09-23 Google LLC Facilitating end-to-end communications with automated assistants in multiple languages
KR102508677B1 (en) * 2018-03-08 2023-03-13 삼성전자주식회사 System for processing user utterance and controlling method thereof
US11037545B2 (en) 2018-03-19 2021-06-15 Facet Labs, Llc Interactive personal assistive devices and systems with artificial intelligence, and related methods
US20190354599A1 (en) 2018-05-21 2019-11-21 Microsoft Technology Licensing, Llc Ai model canvas
CN110728363B (en) 2018-06-29 2022-11-18 华为技术有限公司 Task processing method and device
US10769495B2 (en) 2018-08-01 2020-09-08 Adobe Inc. Collecting multimodal image editing requests
US10402589B1 (en) 2018-12-20 2019-09-03 Vijay K. Madisetti Method and system for securing cloud storage and databases from insider threats and optimizing performance
US20200242146A1 (en) 2019-01-24 2020-07-30 Andrew R. Kalukin Artificial intelligence system for generating conjectures and comprehending text, audio, and visual data using natural language understanding
US11544594B2 (en) 2019-04-11 2023-01-03 Sunghee Woo Electronic device comprising user interface for providing user-participating-type AI training service, and server and method for providing user-participating-type AI training service using the electronic device
US11715467B2 (en) 2019-04-17 2023-08-01 Tempus Labs, Inc. Collaborative artificial intelligence method and system
US11328717B2 (en) 2019-04-18 2022-05-10 Lg Electronics Inc. Electronic device, operating method thereof, system having plural artificial intelligence devices
US20200342968A1 (en) 2019-04-24 2020-10-29 GE Precision Healthcare LLC Visualization of medical device event processing
WO2020246634A1 (en) 2019-06-04 2020-12-10 엘지전자 주식회사 Artificial intelligence device capable of controlling operation of other devices, and operation method thereof
KR20190080834A (en) 2019-06-18 2019-07-08 엘지전자 주식회사 Dialect phoneme adaptive training system and method
US11501753B2 (en) 2019-06-26 2022-11-15 Samsung Electronics Co., Ltd. System and method for automating natural language understanding (NLU) in skill development
US20210011887A1 (en) 2019-07-12 2021-01-14 Qualcomm Incorporated Activity query response system
KR20190095181A (en) 2019-07-25 2019-08-14 엘지전자 주식회사 Video conference system using artificial intelligence
KR20190099167A (en) 2019-08-06 2019-08-26 엘지전자 주식회사 An artificial intelligence apparatus for performing speech recognition and method for the same
US11222464B2 (en) 2019-08-22 2022-01-11 The Travelers Indemnity Company Intelligent imagery
US10827028B1 (en) 2019-09-05 2020-11-03 Spotify Ab Systems and methods for playing media content on a target device
US11636102B2 (en) 2019-09-05 2023-04-25 Verizon Patent And Licensing Inc. Natural language-based content system with corrective feedback and training
KR20210066328A (en) 2019-11-28 2021-06-07 엘지전자 주식회사 An artificial intelligence apparatus for learning natural language understanding models
US11042369B1 (en) 2020-02-03 2021-06-22 Architecture Technology Corporation Systems and methods for modernizing and optimizing legacy source code
US11995561B2 (en) 2020-03-17 2024-05-28 MeetKai, Inc. Universal client API for AI services
US11991253B2 (en) 2020-03-17 2024-05-21 MeetKai, Inc. Intelligent layer to power cross platform, edge-cloud hybrid artificial intelligence services
US11521597B2 (en) * 2020-09-03 2022-12-06 Google Llc Correcting speech misrecognition of spoken utterances
US11984124B2 (en) * 2020-11-13 2024-05-14 Apple Inc. Speculative task flow execution
US11676593B2 (en) * 2020-12-01 2023-06-13 International Business Machines Corporation Training an artificial intelligence of a voice response system based on non_verbal feedback

Similar Documents

Publication Publication Date Title
US10331784B2 (en) System and method of disambiguating natural language processing requests
EP3365890B1 (en) Learning personalized entity pronunciations
US11669300B1 (en) Wake word detection configuration
JP6570651B2 (en) Voice dialogue apparatus and voice dialogue method
US9015048B2 (en) Incremental speech recognition for dialog systems
JP7204690B2 (en) Tailor interactive dialog applications based on author-provided content
US9495350B2 (en) System and method for determining expertise through speech analytics
US11823678B2 (en) Proactive command framework
US20180268810A1 (en) System and method for rapid customization of speech recognition models
US20210193116A1 (en) Data driven dialog management
US9336772B1 (en) Predictive natural language processing models
US20200357390A1 (en) Apparatus for media entity pronunciation using deep learning
US20190318742A1 (en) Collaborative automatic speech recognition
WO2018034169A1 (en) Dialogue control device and method
JP2024513778A (en) Self-adaptive distillation
US20230395066A1 (en) Hot-word free pre-emption of automated assistant response presentation
JPWO2021183681A5 (en)
US20210233536A1 (en) Information processing system, information processing apparatus, and computer readable recording medium
WO2022261808A1 (en) Contextual spelling correction (csc) for automatic speech recognition (asr)
JP4623278B2 (en) Voice dialogue apparatus, support apparatus, generation apparatus, voice dialogue method, support method, generation method, and program
Boonstra Building Voice Agents
JP2022026164A (en) Information output system
JP2018097201A (en) Voice dialog device and voice dialog method
JP6258002B2 (en) Speech recognition system and method for controlling speech recognition system