JP7840941B2 - コンテキストベースのモデル選択 - Google Patents
コンテキストベースのモデル選択Info
- Publication number
- JP7840941B2 JP7840941B2 JP2023528472A JP2023528472A JP7840941B2 JP 7840941 B2 JP7840941 B2 JP 7840941B2 JP 2023528472 A JP2023528472 A JP 2023528472A JP 2023528472 A JP2023528472 A JP 2023528472A JP 7840941 B2 JP7840941 B2 JP 7840941B2
- Authority
- JP
- Japan
- Prior art keywords
- model
- context
- data
- sec
- processors
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/082—Learning methods modifying the architecture, e.g. adding, deleting or silencing nodes or connections
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/10—Terrestrial scenes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/34—Network arrangements or protocols for supporting network services or applications involving the movement of software or configuration parameters
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04W—WIRELESS COMMUNICATION NETWORKS
- H04W4/00—Services specially adapted for wireless communication networks; Facilities therefor
- H04W4/02—Services making use of location information
- H04W4/025—Services making use of location information using location based information parameters
- H04W4/027—Services making use of location information using location based information parameters using movement velocity, acceleration information
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Acoustics & Sound (AREA)
- Human Computer Interaction (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Signal Processing (AREA)
- Artificial Intelligence (AREA)
- Computer Networks & Wireless Communication (AREA)
- General Physics & Mathematics (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Evolutionary Computation (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- User Interface Of Digital Computer (AREA)
- Measurement And Recording Of Electrical Phenomena And Electrical Characteristics Of The Living Body (AREA)
- Electrotherapy Devices (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US17/102,748 US12198057B2 (en) | 2020-11-24 | 2020-11-24 | Context-based model selection |
| US17/102,748 | 2020-11-24 | ||
| PCT/US2021/072521 WO2022115839A1 (en) | 2020-11-24 | 2021-11-19 | Context-based model selection |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2023550336A JP2023550336A (ja) | 2023-12-01 |
| JP2023550336A5 JP2023550336A5 (https=) | 2024-10-29 |
| JP7840941B2 true JP7840941B2 (ja) | 2026-04-06 |
Family
ID=79024726
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2023528472A Active JP7840941B2 (ja) | 2020-11-24 | 2021-11-19 | コンテキストベースのモデル選択 |
Country Status (7)
| Country | Link |
|---|---|
| US (2) | US12198057B2 (https=) |
| EP (1) | EP4252230A1 (https=) |
| JP (1) | JP7840941B2 (https=) |
| KR (1) | KR20230110518A (https=) |
| CN (1) | CN116601703A (https=) |
| TW (1) | TW202232362A (https=) |
| WO (1) | WO2022115839A1 (https=) |
Families Citing this family (13)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| KR20220111078A (ko) * | 2021-02-01 | 2022-08-09 | 삼성전자주식회사 | 전자 장치, 사운드 입출력 기기를 포함하는 시스템 및 그 제어 방법 |
| US20220391685A1 (en) * | 2021-06-02 | 2022-12-08 | Arm Limited | System, devices and/or processes for augmenting artificial intelligence agent and computing devices |
| JP7548147B2 (ja) * | 2021-07-19 | 2024-09-10 | トヨタ自動車株式会社 | 配送車両 |
| JP7666430B2 (ja) * | 2022-07-15 | 2025-04-22 | トヨタ自動車株式会社 | 車両用情報処理装置、車両用情報処理システム及び車両用情報処理方法 |
| US12293773B2 (en) * | 2022-11-03 | 2025-05-06 | Robert Bosch Gmbh | Automatically selecting a sound recognition model for an environment based on audio data and image data associated with the environment |
| WO2024259213A1 (en) * | 2023-06-15 | 2024-12-19 | Convida Wireless, Llc | Inferencing model selection and management |
| WO2025023469A1 (ko) * | 2023-07-26 | 2025-01-30 | 삼성전자주식회사 | 시각 정보를 고려해서 인공지능 에이전트를 동작하는 장치 및 방법 |
| TWI887855B (zh) * | 2023-11-18 | 2025-06-21 | 鴻海精密工業股份有限公司 | 車輛、車輛降噪方法及系統 |
| DE102023132751A1 (de) * | 2023-11-23 | 2025-05-28 | Audi Aktiengesellschaft | Verfahren zum Betreiben eines Sprachdialogsystems sowie Sprachdialogsystem |
| US12542707B2 (en) * | 2024-02-22 | 2026-02-03 | Dell Products L.P. | Facilitating intelligent concept drift mitigation in advanced communication networks |
| TWI902207B (zh) * | 2024-04-08 | 2025-10-21 | 律芯科技股份有限公司 | 基於虛擬麥克風的車用降噪系統及車用降噪系統訓練方法 |
| US12608422B2 (en) * | 2024-07-24 | 2026-04-21 | Robert Bosch Gmbh | Video management system and method for audio event search and classification |
| WO2026053065A1 (en) * | 2024-09-03 | 2026-03-12 | Cochlear Limited | Linguistic context in hearing device systems |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20140303972A1 (en) | 2007-05-29 | 2014-10-09 | At&T Intellectual Property Ii, L.P. | Method and Apparatus for Identifying Acoustic Background Environments Based on Time and Speed to Enhance Automatic Speech Recognition |
| WO2017217412A1 (ja) | 2016-06-16 | 2017-12-21 | 日本電気株式会社 | 信号処理装置、信号処理方法およびコンピュータ読み取り可能記録媒体 |
| US20180330737A1 (en) | 2017-05-12 | 2018-11-15 | Apple Inc. | User-specific acoustic models |
| US20180336000A1 (en) | 2017-05-19 | 2018-11-22 | Intel Corporation | Contextual sound filter |
| US20190206418A1 (en) | 2016-09-09 | 2019-07-04 | Huawei Technologies Co., Ltd. | Device and a method for classifying an acoustic environment |
Family Cites Families (47)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4827521A (en) | 1986-03-27 | 1989-05-02 | International Business Machines Corporation | Training of markov models used in a speech recognition system |
| CN100472500C (zh) | 1998-10-02 | 2009-03-25 | 联想(新加坡)私人有限公司 | 会话浏览器和会话系统 |
| EP1282113B1 (en) | 2001-08-02 | 2005-01-12 | Sony International (Europe) GmbH | Method for detecting emotions from speech using speaker identification |
| US7620547B2 (en) | 2002-07-25 | 2009-11-17 | Sony Deutschland Gmbh | Spoken man-machine interface with speaker identification |
| US7640160B2 (en) | 2005-08-05 | 2009-12-29 | Voicebox Technologies, Inc. | Systems and methods for responding to natural language speech utterance |
| US7949529B2 (en) | 2005-08-29 | 2011-05-24 | Voicebox Technologies, Inc. | Mobile systems and methods of supporting natural language human-machine interactions |
| US20070183604A1 (en) | 2006-02-09 | 2007-08-09 | St-Infonox | Response to anomalous acoustic environments |
| US7877335B2 (en) | 2007-10-18 | 2011-01-25 | Yahoo! Inc. | System and method for learning a network of categories using prediction |
| US8788270B2 (en) | 2009-06-16 | 2014-07-22 | University Of Florida Research Foundation, Inc. | Apparatus and method for determining an emotion state of a speaker |
| US8600743B2 (en) | 2010-01-06 | 2013-12-03 | Apple Inc. | Noise profile determination for voice-related feature |
| US8381107B2 (en) | 2010-01-13 | 2013-02-19 | Apple Inc. | Adaptive audio feedback system and method |
| US9165556B1 (en) | 2012-02-01 | 2015-10-20 | Predictive Business Intelligence, LLC | Methods and systems related to audio data processing to provide key phrase notification and potential cost associated with the key phrase |
| US9575963B2 (en) | 2012-04-20 | 2017-02-21 | Maluuba Inc. | Conversational agent |
| US8463648B1 (en) | 2012-05-04 | 2013-06-11 | Pearl.com LLC | Method and apparatus for automated topic extraction used for the creation and promotion of new categories in a consultation system |
| US20140074466A1 (en) | 2012-09-10 | 2014-03-13 | Google Inc. | Answering questions using environmental context |
| US10134401B2 (en) | 2012-11-21 | 2018-11-20 | Verint Systems Ltd. | Diarization using linguistic labeling |
| US9449613B2 (en) | 2012-12-06 | 2016-09-20 | Audeme Llc | Room identification using acoustic features in a recording |
| US10013483B2 (en) | 2014-01-30 | 2018-07-03 | Microsoft Technology Licensing, Llc | System and method for identifying trending topics in a social network |
| WO2015120184A1 (en) | 2014-02-06 | 2015-08-13 | Otosense Inc. | Instant real time neuro-compatible imaging of signals |
| WO2015196063A1 (en) | 2014-06-19 | 2015-12-23 | Robert Bosch Gmbh | System and method for speech-enabled personalized operation of devices and services in multiple operating environments |
| US10073673B2 (en) | 2014-07-14 | 2018-09-11 | Samsung Electronics Co., Ltd. | Method and system for robust tagging of named entities in the presence of source or translation errors |
| US9412361B1 (en) | 2014-09-30 | 2016-08-09 | Amazon Technologies, Inc. | Configuring system operation using image data |
| US9643511B2 (en) | 2014-12-17 | 2017-05-09 | Samsung Electronics Co., Ltd. | Method and apparatus for estimating state of charge (SOC) of battery in electric vehicle |
| JP5956624B1 (ja) | 2015-02-02 | 2016-07-27 | 西日本高速道路エンジニアリング四国株式会社 | 異常音の検出方法及びその検出値を用いた構造物の異常判定方法、並びに、振動波の類似度検出方法及びその検出値を用いた音声認識方法 |
| WO2016135069A1 (en) * | 2015-02-26 | 2016-09-01 | Koninklijke Philips N.V. | Context detection for medical monitoring |
| US10482184B2 (en) | 2015-03-08 | 2019-11-19 | Google Llc | Context-based natural language processing |
| JP6556575B2 (ja) | 2015-09-15 | 2019-08-07 | 株式会社東芝 | 音声処理装置、音声処理方法及び音声処理プログラム |
| US9847000B2 (en) | 2015-10-29 | 2017-12-19 | Immersion Corporation | Ambient triggered notifications for rendering haptic effects |
| US9946862B2 (en) | 2015-12-01 | 2018-04-17 | Qualcomm Incorporated | Electronic device generating notification based on context data in response to speech phrase from user |
| US10026401B1 (en) | 2015-12-28 | 2018-07-17 | Amazon Technologies, Inc. | Naming devices via voice commands |
| US10902043B2 (en) * | 2016-01-03 | 2021-01-26 | Gracenote, Inc. | Responding to remote media classification queries using classifier models and context parameters |
| US10373612B2 (en) | 2016-03-21 | 2019-08-06 | Amazon Technologies, Inc. | Anchored speech detection and speech recognition |
| US10304444B2 (en) | 2016-03-23 | 2019-05-28 | Amazon Technologies, Inc. | Fine-grained natural language understanding |
| WO2017187712A1 (ja) | 2016-04-26 | 2017-11-02 | 株式会社ソニー・インタラクティブエンタテインメント | 情報処理装置 |
| US10026405B2 (en) | 2016-05-03 | 2018-07-17 | SESTEK Ses velletisim Bilgisayar Tekn. San. Ve Tic A.S. | Method for speaker diarization |
| US10705683B2 (en) | 2016-10-31 | 2020-07-07 | Microsoft Technology Licensing, Llc | Changing visual aspects of a graphical user interface to bring focus to a message |
| EP3545374B1 (en) | 2016-11-23 | 2024-11-06 | Alarm.com Incorporated | Detection of authorized user presence and handling of unauthenticated monitoring system commands |
| US10713703B2 (en) | 2016-11-30 | 2020-07-14 | Apple Inc. | Diversity in media item recommendations |
| EP3905242A1 (en) * | 2017-05-12 | 2021-11-03 | Apple Inc. | User-specific acoustic models |
| US10311454B2 (en) | 2017-06-22 | 2019-06-04 | NewVoiceMedia Ltd. | Customer interaction and experience system using emotional-semantic computing |
| WO2019070328A1 (en) * | 2017-10-04 | 2019-04-11 | Google Llc | METHODS AND SYSTEMS FOR AUTOMATICALLY EQUALIZING AUDIO OUTPUT BASED ON THE CHARACTERISTICS OF THE PART |
| US10360482B1 (en) * | 2017-12-04 | 2019-07-23 | Amazon Technologies, Inc. | Crowd-sourced artificial intelligence image processing services |
| US10481858B2 (en) | 2017-12-06 | 2019-11-19 | Harman International Industries, Incorporated | Generating personalized audio content based on mood |
| US10832009B2 (en) | 2018-01-02 | 2020-11-10 | International Business Machines Corporation | Extraction and summarization of decision elements from communications |
| CN110276235B (zh) | 2018-01-25 | 2023-06-16 | 意法半导体公司 | 通过感测瞬态事件和连续事件的智能装置的情境感知 |
| US11011162B2 (en) * | 2018-06-01 | 2021-05-18 | Soundhound, Inc. | Custom acoustic models |
| US11195534B1 (en) * | 2020-03-30 | 2021-12-07 | Amazon Technologies, Inc. | Permissioning for natural language processing systems |
-
2020
- 2020-11-24 US US17/102,748 patent/US12198057B2/en active Active
-
2021
- 2021-11-19 KR KR1020237016581A patent/KR20230110518A/ko active Pending
- 2021-11-19 JP JP2023528472A patent/JP7840941B2/ja active Active
- 2021-11-19 EP EP21831198.3A patent/EP4252230A1/en active Pending
- 2021-11-19 CN CN202180077450.9A patent/CN116601703A/zh active Pending
- 2021-11-19 WO PCT/US2021/072521 patent/WO2022115839A1/en not_active Ceased
- 2021-11-22 TW TW110143343A patent/TW202232362A/zh unknown
-
2024
- 2024-12-05 US US18/970,538 patent/US20250103888A1/en active Pending
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20140303972A1 (en) | 2007-05-29 | 2014-10-09 | At&T Intellectual Property Ii, L.P. | Method and Apparatus for Identifying Acoustic Background Environments Based on Time and Speed to Enhance Automatic Speech Recognition |
| WO2017217412A1 (ja) | 2016-06-16 | 2017-12-21 | 日本電気株式会社 | 信号処理装置、信号処理方法およびコンピュータ読み取り可能記録媒体 |
| US20190206418A1 (en) | 2016-09-09 | 2019-07-04 | Huawei Technologies Co., Ltd. | Device and a method for classifying an acoustic environment |
| US20180330737A1 (en) | 2017-05-12 | 2018-11-15 | Apple Inc. | User-specific acoustic models |
| US20180336000A1 (en) | 2017-05-19 | 2018-11-22 | Intel Corporation | Contextual sound filter |
Also Published As
| Publication number | Publication date |
|---|---|
| KR20230110518A (ko) | 2023-07-24 |
| WO2022115839A1 (en) | 2022-06-02 |
| US20250103888A1 (en) | 2025-03-27 |
| JP2023550336A (ja) | 2023-12-01 |
| US20220164662A1 (en) | 2022-05-26 |
| CN116601703A (zh) | 2023-08-15 |
| EP4252230A1 (en) | 2023-10-04 |
| TW202232362A (zh) | 2022-08-16 |
| US12198057B2 (en) | 2025-01-14 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP7840941B2 (ja) | コンテキストベースのモデル選択 | |
| JP7757405B2 (ja) | 適応型サウンドイベント分類 | |
| US10869154B2 (en) | Location-based personal audio | |
| CN113377899A (zh) | 意图识别方法及电子设备 | |
| EP4066243B1 (en) | Sound event detection learning | |
| JP6619488B2 (ja) | 人工知能機器における連続会話機能 | |
| KR20230110512A (ko) | 사운드 이벤트 분류를 위한 전이 학습 | |
| US11822889B2 (en) | Personal conversationalist system | |
| US12114075B1 (en) | Object selection in computer vision | |
| CN105573128B (zh) | 用户装置及其驱动方法、提供服务的设备及其驱动方法 | |
| JP7501523B2 (ja) | 情報処理装置、情報処理方法、およびプログラム | |
| US20190163436A1 (en) | Electronic device and method for controlling the same | |
| CN117897676A (zh) | 与iot设备构建增强现实体验 | |
| US11997445B2 (en) | Systems and methods for live conversation using hearing devices | |
| CN117916692A (zh) | 使用ar摄像头对iot设备的双向控制 | |
| US12596961B1 (en) | Local device embeddings for automation | |
| KR20250138610A (ko) | 맥락에 기반하여 사용자 입력에 대한 응답을 생성하는 방법, 장치 및 기록 매체 | |
| KR20240049831A (ko) | IoT 디바이스들과 상호 작용하는 카메라 인터페이스들 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20241021 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20241021 |
|
| A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20250926 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20251007 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20251219 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20260303 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20260325 |
|
| R150 | Certificate of patent or registration of utility model |
Ref document number: 7840941 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |