JP2022539794A5 - - Google Patents

Info

Publication number
JP2022539794A5
JP2022539794A5 JP2022500128A JP2022500128A JP2022539794A5 JP 2022539794 A5 JP2022539794 A5 JP 2022539794A5 JP 2022500128 A JP2022500128 A JP 2022500128A JP 2022500128 A JP2022500128 A JP 2022500128A JP 2022539794 A5 JP2022539794 A5 JP 2022539794A5
Authority
JP
Japan
Prior art keywords
input
mode
data
user
feedback message
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2022500128A
Other languages
English (en)
Japanese (ja)
Other versions
JP7522177B2 (ja
JP2022539794A (ja
Filing date
Publication date
Priority claimed from US16/685,946 external-priority patent/US11348581B2/en
Application filed filed Critical
Publication of JP2022539794A publication Critical patent/JP2022539794A/ja
Publication of JP2022539794A5 publication Critical patent/JP2022539794A5/ja
Application granted granted Critical
Publication of JP7522177B2 publication Critical patent/JP7522177B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

JP2022500128A 2019-07-12 2020-07-10 マルチモーダルユーザインターフェース Active JP7522177B2 (ja)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US201962873775P 2019-07-12 2019-07-12
US62/873,775 2019-07-12
US16/685,946 2019-11-15
US16/685,946 US11348581B2 (en) 2019-07-12 2019-11-15 Multi-modal user interface
PCT/US2020/041499 WO2021011331A1 (en) 2019-07-12 2020-07-10 Multi-modal user interface

Publications (3)

Publication Number Publication Date
JP2022539794A JP2022539794A (ja) 2022-09-13
JP2022539794A5 true JP2022539794A5 (https=) 2023-06-20
JP7522177B2 JP7522177B2 (ja) 2024-07-24

Family

ID=74101815

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2022500128A Active JP7522177B2 (ja) 2019-07-12 2020-07-10 マルチモーダルユーザインターフェース

Country Status (9)

Country Link
US (1) US11348581B2 (https=)
EP (1) EP3997553A1 (https=)
JP (1) JP7522177B2 (https=)
KR (1) KR20220031610A (https=)
CN (1) CN114127665B (https=)
BR (1) BR112021026765A2 (https=)
PH (1) PH12021553219A1 (https=)
TW (1) TWI840587B (https=)
WO (1) WO2021011331A1 (https=)

Families Citing this family (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2021103191A (ja) * 2018-03-30 2021-07-15 ソニーグループ株式会社 情報処理装置および情報処理方法
US11615801B1 (en) * 2019-09-20 2023-03-28 Apple Inc. System and method of enhancing intelligibility of audio playback
US11521643B2 (en) * 2020-05-08 2022-12-06 Bose Corporation Wearable audio device with user own-voice recording
WO2022016406A1 (zh) * 2020-07-22 2022-01-27 北京小米移动软件有限公司 信息传输方法、装置及通信设备
US11996095B2 (en) 2020-08-12 2024-05-28 Kyndryl, Inc. Augmented reality enabled command management
US11878244B2 (en) * 2020-09-10 2024-01-23 Holland Bloorview Kids Rehabilitation Hospital Customizable user input recognition systems
US11830486B2 (en) * 2020-10-13 2023-11-28 Google Llc Detecting near matches to a hotword or phrase
US11461681B2 (en) * 2020-10-14 2022-10-04 Openstream Inc. System and method for multi-modality soft-agent for query population and information mining
US11809480B1 (en) * 2020-12-31 2023-11-07 Meta Platforms, Inc. Generating dynamic knowledge graph of media contents for assistant systems
US12321865B2 (en) * 2021-01-25 2025-06-03 Salesforce, Inc. Event prediction based on multimodal learning
US11651541B2 (en) * 2021-03-01 2023-05-16 Roblox Corporation Integrated input/output (I/O) for a three-dimensional (3D) environment
CN113282172A (zh) * 2021-05-18 2021-08-20 前海七剑科技(深圳)有限公司 一种手势识别的控制方法和装置
US11783073B2 (en) * 2021-06-21 2023-10-10 Microsoft Technology Licensing, Llc Configuration of default sensitivity labels for network file storage locations
WO2023272629A1 (zh) * 2021-06-30 2023-01-05 华为技术有限公司 界面的控制方法、装置和系统
US12614095B2 (en) * 2021-07-12 2026-04-28 Cypress Semiconductor Corporation System and method for activity classification
WO2023035073A1 (en) * 2021-09-08 2023-03-16 Huawei Technologies Canada Co., Ltd. Methods and devices for communication with multimodal compositions
US11966663B1 (en) * 2021-09-29 2024-04-23 Amazon Technologies, Inc. Speech processing and multi-modal widgets
US20230104856A1 (en) * 2021-10-05 2023-04-06 Rfmicron, Inc. Data logging device
US11971710B2 (en) * 2021-11-12 2024-04-30 Pani Energy Inc Digital model based plant operation and optimization
US12333794B2 (en) * 2021-11-12 2025-06-17 Sony Group Corporation Emotion recognition in multimedia videos using multi-modal fusion-based deep neural network
WO2024029827A1 (ko) * 2022-08-01 2024-02-08 삼성전자 주식회사 제어 추천을 위한 전자 장치 및 컴퓨터 판독가능 저장 매체
US20240036527A1 (en) * 2022-08-01 2024-02-01 Samsung Electronics Co., Ltd. Electronic device and computer readable storage medium for control recommendation
KR20240079507A (ko) * 2022-11-29 2024-06-05 한국전자통신연구원 크로스모달 정보를 이용한 언어모델 생성 방법 및 장치
EP4524685A1 (en) * 2023-09-12 2025-03-19 Rohde & Schwarz GmbH & Co. KG Measurement application device, and method
US20250178624A1 (en) * 2023-12-01 2025-06-05 Qualcomm Incorporated Speech-based vehicular control
US20260016309A1 (en) * 2024-07-11 2026-01-15 Apple Inc. Providing movement dynamics estimations

Family Cites Families (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8386255B2 (en) * 2009-03-17 2013-02-26 Avaya Inc. Providing descriptions of visually presented information to video teleconference participants who are not video-enabled
US9123341B2 (en) 2009-03-18 2015-09-01 Robert Bosch Gmbh System and method for multi-modal input synchronization and disambiguation
KR101092820B1 (ko) 2009-09-22 2011-12-12 현대자동차주식회사 립리딩과 음성 인식 통합 멀티모달 인터페이스 시스템
US8473289B2 (en) * 2010-08-06 2013-06-25 Google Inc. Disambiguating input based on context
US20130031076A1 (en) * 2011-07-28 2013-01-31 Kikin, Inc. Systems and methods for contextual searching of semantic entities
US20130085753A1 (en) * 2011-09-30 2013-04-04 Google Inc. Hybrid Client/Server Speech Recognition In A Mobile Device
US9152376B2 (en) * 2011-12-01 2015-10-06 At&T Intellectual Property I, L.P. System and method for continuous multimodal speech and gesture interaction
US9465833B2 (en) * 2012-07-31 2016-10-11 Veveo, Inc. Disambiguating user intent in conversational interaction system for large corpus information retrieval
CN103729386B (zh) * 2012-10-16 2017-08-04 阿里巴巴集团控股有限公司 信息查询系统与方法
WO2014070872A2 (en) 2012-10-30 2014-05-08 Robert Bosch Gmbh System and method for multimodal interaction with reduced distraction in operating vehicles
US9190058B2 (en) * 2013-01-25 2015-11-17 Microsoft Technology Licensing, Llc Using visual cues to disambiguate speech inputs
WO2014182787A2 (en) 2013-05-08 2014-11-13 Jpmorgan Chase Bank, N.A. Systems and methods for high fidelity multi-modal out-of-band biometric authentication
US10402060B2 (en) 2013-06-28 2019-09-03 Orange System and method for gesture disambiguation
US10741182B2 (en) * 2014-02-18 2020-08-11 Lenovo (Singapore) Pte. Ltd. Voice input correction using non-audio based input
US8825585B1 (en) 2014-03-11 2014-09-02 Fmr Llc Interpretation of natural communication
US20160034249A1 (en) * 2014-07-31 2016-02-04 Microsoft Technology Licensing Llc Speechless interaction with a speech recognition device
US10446141B2 (en) * 2014-08-28 2019-10-15 Apple Inc. Automatic speech recognition based on user feedback
CN105843605B (zh) * 2016-03-17 2019-03-08 中国银行股份有限公司 一种数据映射方法及装置
JP2018036902A (ja) * 2016-08-31 2018-03-08 島根県 機器操作システム、機器操作方法および機器操作プログラム
DK201770411A1 (en) * 2017-05-15 2018-12-20 Apple Inc. Multi-modal interfaces
US20180357040A1 (en) * 2017-06-09 2018-12-13 Mitsubishi Electric Automotive America, Inc. In-vehicle infotainment with multi-modal interface
US11430437B2 (en) * 2017-08-01 2022-08-30 Sony Corporation Information processor and information processing method

Similar Documents

Publication Publication Date Title
JP2022539794A5 (https=)
JP7522177B2 (ja) マルチモーダルユーザインターフェース
JP2023550092A5 (https=)
JP2023550336A5 (https=)
JP7757405B2 (ja) 適応型サウンドイベント分類
JP2022542295A5 (https=)
US20170345425A1 (en) Voice dialog device and voice dialog method
JP2022543201A5 (https=)
CN105453174A (zh) 话音增强方法及其装置
KR102193029B1 (ko) 디스플레이 장치 및 그의 화상 통화 수행 방법
CN105389097A (zh) 一种人机交互装置及方法
CN112154412B (zh) 用数字助理提供音频信息
KR20120072243A (ko) 음향/음성 인식을 위한 잡음 제거 장치 및 그 방법
JP7753363B2 (ja) ユーザ発話プロファイル管理
KR102779400B1 (ko) 탠덤 네트워크들을 사용한 잡음 억제
WO2019107145A1 (ja) 情報処理装置、及び情報処理方法
JPWO2016151956A1 (ja) 情報処理システムおよび情報処理方法
WO2017141530A1 (ja) 情報処理装置、情報処理方法、及びプログラム
EP4588038A1 (en) Source speech modification based on an input speech characteristic
KR20200048701A (ko) 사용자 특화 음성 명령어를 공유하기 위한 전자 장치 및 그 제어 방법
CN110097872A (zh) 一种音频处理方法及电子设备
CN110194181A (zh) 驾驶支持方法、车辆和驾驶支持系统
JP2023545981A5 (https=)
JP7703795B2 (ja) 音源表現を使用したオーディオ処理
KR20250069531A (ko) 비디오 스트림으로의 키워드-기반 객체 삽입