JPWO2021011331A5 - - Google Patents

Download PDF

Info

Publication number
JPWO2021011331A5
JPWO2021011331A5 JP2022500128A JP2022500128A JPWO2021011331A5 JP WO2021011331 A5 JPWO2021011331 A5 JP WO2021011331A5 JP 2022500128 A JP2022500128 A JP 2022500128A JP 2022500128 A JP2022500128 A JP 2022500128A JP WO2021011331 A5 JPWO2021011331 A5 JP WO2021011331A5
Authority
JP
Japan
Prior art keywords
input
mode
data
user
processors
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2022500128A
Other languages
English (en)
Japanese (ja)
Other versions
JP2022539794A5 (https=
JP7522177B2 (ja
JP2022539794A (ja
Publication date
Priority claimed from US16/685,946 external-priority patent/US11348581B2/en
Application filed filed Critical
Publication of JP2022539794A publication Critical patent/JP2022539794A/ja
Publication of JP2022539794A5 publication Critical patent/JP2022539794A5/ja
Publication of JPWO2021011331A5 publication Critical patent/JPWO2021011331A5/ja
Application granted granted Critical
Publication of JP7522177B2 publication Critical patent/JP7522177B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

JP2022500128A 2019-07-12 2020-07-10 マルチモーダルユーザインターフェース Active JP7522177B2 (ja)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US201962873775P 2019-07-12 2019-07-12
US62/873,775 2019-07-12
US16/685,946 US11348581B2 (en) 2019-07-12 2019-11-15 Multi-modal user interface
US16/685,946 2019-11-15
PCT/US2020/041499 WO2021011331A1 (en) 2019-07-12 2020-07-10 Multi-modal user interface

Publications (4)

Publication Number Publication Date
JP2022539794A JP2022539794A (ja) 2022-09-13
JP2022539794A5 JP2022539794A5 (https=) 2023-06-20
JPWO2021011331A5 true JPWO2021011331A5 (https=) 2023-06-20
JP7522177B2 JP7522177B2 (ja) 2024-07-24

Family

ID=74101815

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2022500128A Active JP7522177B2 (ja) 2019-07-12 2020-07-10 マルチモーダルユーザインターフェース

Country Status (9)

Country Link
US (1) US11348581B2 (https=)
EP (1) EP3997553A1 (https=)
JP (1) JP7522177B2 (https=)
KR (1) KR20220031610A (https=)
CN (1) CN114127665B (https=)
BR (1) BR112021026765A2 (https=)
PH (1) PH12021553219A1 (https=)
TW (1) TWI840587B (https=)
WO (1) WO2021011331A1 (https=)

Families Citing this family (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2021103191A (ja) * 2018-03-30 2021-07-15 ソニーグループ株式会社 情報処理装置および情報処理方法
US11615801B1 (en) * 2019-09-20 2023-03-28 Apple Inc. System and method of enhancing intelligibility of audio playback
US11521643B2 (en) * 2020-05-08 2022-12-06 Bose Corporation Wearable audio device with user own-voice recording
EP4187930A4 (en) * 2020-07-22 2024-04-03 Beijing Xiaomi Mobile Software Co., Ltd. INFORMATION TRANSMISSION METHOD AND APPARATUS, AND COMMUNICATION DEVICE
US11996095B2 (en) * 2020-08-12 2024-05-28 Kyndryl, Inc. Augmented reality enabled command management
US11878244B2 (en) * 2020-09-10 2024-01-23 Holland Bloorview Kids Rehabilitation Hospital Customizable user input recognition systems
US11830486B2 (en) * 2020-10-13 2023-11-28 Google Llc Detecting near matches to a hotword or phrase
US11461681B2 (en) * 2020-10-14 2022-10-04 Openstream Inc. System and method for multi-modality soft-agent for query population and information mining
US11809480B1 (en) * 2020-12-31 2023-11-07 Meta Platforms, Inc. Generating dynamic knowledge graph of media contents for assistant systems
US12321865B2 (en) * 2021-01-25 2025-06-03 Salesforce, Inc. Event prediction based on multimodal learning
US11651541B2 (en) * 2021-03-01 2023-05-16 Roblox Corporation Integrated input/output (I/O) for a three-dimensional (3D) environment
CN113282172A (zh) * 2021-05-18 2021-08-20 前海七剑科技(深圳)有限公司 一种手势识别的控制方法和装置
US11783073B2 (en) * 2021-06-21 2023-10-10 Microsoft Technology Licensing, Llc Configuration of default sensitivity labels for network file storage locations
CN116670624A (zh) * 2021-06-30 2023-08-29 华为技术有限公司 界面的控制方法、装置和系统
CN118251878A (zh) * 2021-09-08 2024-06-25 华为技术加拿大有限公司 使用多模态合成进行通信的方法和设备
US11966663B1 (en) * 2021-09-29 2024-04-23 Amazon Technologies, Inc. Speech processing and multi-modal widgets
US20230104856A1 (en) * 2021-10-05 2023-04-06 Rfmicron, Inc. Data logging device
US12333794B2 (en) * 2021-11-12 2025-06-17 Sony Group Corporation Emotion recognition in multimedia videos using multi-modal fusion-based deep neural network
US11971710B2 (en) * 2021-11-12 2024-04-30 Pani Energy Inc Digital model based plant operation and optimization
US20240036527A1 (en) * 2022-08-01 2024-02-01 Samsung Electronics Co., Ltd. Electronic device and computer readable storage medium for control recommendation
WO2024029827A1 (ko) * 2022-08-01 2024-02-08 삼성전자 주식회사 제어 추천을 위한 전자 장치 및 컴퓨터 판독가능 저장 매체
KR20240079507A (ko) * 2022-11-29 2024-06-05 한국전자통신연구원 크로스모달 정보를 이용한 언어모델 생성 방법 및 장치
EP4524685A1 (en) * 2023-09-12 2025-03-19 Rohde & Schwarz GmbH & Co. KG Measurement application device, and method
US20250178624A1 (en) * 2023-12-01 2025-06-05 Qualcomm Incorporated Speech-based vehicular control
US20260016309A1 (en) * 2024-07-11 2026-01-15 Apple Inc. Providing movement dynamics estimations

Family Cites Families (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8386255B2 (en) * 2009-03-17 2013-02-26 Avaya Inc. Providing descriptions of visually presented information to video teleconference participants who are not video-enabled
US9123341B2 (en) 2009-03-18 2015-09-01 Robert Bosch Gmbh System and method for multi-modal input synchronization and disambiguation
KR101092820B1 (ko) 2009-09-22 2011-12-12 현대자동차주식회사 립리딩과 음성 인식 통합 멀티모달 인터페이스 시스템
US8473289B2 (en) * 2010-08-06 2013-06-25 Google Inc. Disambiguating input based on context
US8898583B2 (en) * 2011-07-28 2014-11-25 Kikin Inc. Systems and methods for providing information regarding semantic entities included in a page of content
US20130085753A1 (en) * 2011-09-30 2013-04-04 Google Inc. Hybrid Client/Server Speech Recognition In A Mobile Device
US9152376B2 (en) * 2011-12-01 2015-10-06 At&T Intellectual Property I, L.P. System and method for continuous multimodal speech and gesture interaction
US9465833B2 (en) * 2012-07-31 2016-10-11 Veveo, Inc. Disambiguating user intent in conversational interaction system for large corpus information retrieval
CN103729386B (zh) * 2012-10-16 2017-08-04 阿里巴巴集团控股有限公司 信息查询系统与方法
WO2014070872A2 (en) 2012-10-30 2014-05-08 Robert Bosch Gmbh System and method for multimodal interaction with reduced distraction in operating vehicles
US9190058B2 (en) * 2013-01-25 2015-11-17 Microsoft Technology Licensing, Llc Using visual cues to disambiguate speech inputs
EP2995040B1 (en) 2013-05-08 2022-11-16 JPMorgan Chase Bank, N.A. Systems and methods for high fidelity multi-modal out-of-band biometric authentication
US10402060B2 (en) 2013-06-28 2019-09-03 Orange System and method for gesture disambiguation
US10741182B2 (en) * 2014-02-18 2020-08-11 Lenovo (Singapore) Pte. Ltd. Voice input correction using non-audio based input
US8825585B1 (en) 2014-03-11 2014-09-02 Fmr Llc Interpretation of natural communication
US20160034249A1 (en) * 2014-07-31 2016-02-04 Microsoft Technology Licensing Llc Speechless interaction with a speech recognition device
US10446141B2 (en) * 2014-08-28 2019-10-15 Apple Inc. Automatic speech recognition based on user feedback
CN105843605B (zh) * 2016-03-17 2019-03-08 中国银行股份有限公司 一种数据映射方法及装置
JP2018036902A (ja) * 2016-08-31 2018-03-08 島根県 機器操作システム、機器操作方法および機器操作プログラム
DK201770411A1 (en) * 2017-05-15 2018-12-20 Apple Inc. Multi-modal interfaces
US20180357040A1 (en) * 2017-06-09 2018-12-13 Mitsubishi Electric Automotive America, Inc. In-vehicle infotainment with multi-modal interface
US11430437B2 (en) * 2017-08-01 2022-08-30 Sony Corporation Information processor and information processing method

Similar Documents

Publication Publication Date Title
JPWO2021011331A5 (https=)
CN110164420B (zh) 一种语音识别的方法、语音断句的方法及装置
EP2961195B1 (en) Do-not-disturb system and apparatus
KR102193029B1 (ko) 디스플레이 장치 및 그의 화상 통화 수행 방법
CN112154412B (zh) 用数字助理提供音频信息
CN105453174A (zh) 话音增强方法及其装置
JP2022188081A (ja) 情報処理装置、情報処理システム、および情報処理方法
CN108701455A (zh) 信息处理装置、信息处理方法和程序
CN110097872A (zh) 一种音频处理方法及电子设备
WO2019107145A1 (ja) 情報処理装置、及び情報処理方法
KR20200048701A (ko) 사용자 특화 음성 명령어를 공유하기 위한 전자 장치 및 그 제어 방법
KR20210017081A (ko) 객체에 대응하는 그래픽 요소 표시 방법 및 장치
KR20240017404A (ko) 탠덤 네트워크들을 사용한 잡음 억제
WO2024059427A1 (en) Source speech modification based on an input speech characteristic
JPWO2022115838A5 (https=)
JPWO2022115839A5 (https=)
CN108765522B (zh) 一种动态图像生成方法及移动终端
JP6977768B2 (ja) 情報処理装置、情報処理方法、音声出力装置、および音声出力方法
CN106293064A (zh) 一种信息处理方法及设备
JP7459391B2 (ja) オーディオソース指向性に基づく心理音響的強調
CN111343420A (zh) 一种语音增强方法及穿戴设备
WO2016206646A1 (zh) 使机器装置产生动作的方法及系统
JPWO2021021970A5 (https=)
KR102204488B1 (ko) 통신 장치
JP2018045192A (ja) 音声対話装置および発話音量調整方法