KR20230110518A - 콘텍스트-기반 모델 선택 - Google Patents
콘텍스트-기반 모델 선택 Download PDFInfo
- Publication number
- KR20230110518A KR20230110518A KR1020237016581A KR20237016581A KR20230110518A KR 20230110518 A KR20230110518 A KR 20230110518A KR 1020237016581 A KR1020237016581 A KR 1020237016581A KR 20237016581 A KR20237016581 A KR 20237016581A KR 20230110518 A KR20230110518 A KR 20230110518A
- Authority
- KR
- South Korea
- Prior art keywords
- model
- context
- data
- sec
- processors
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/082—Learning methods modifying the architecture, e.g. adding, deleting or silencing nodes or connections
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/10—Terrestrial scenes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/34—Network arrangements or protocols for supporting network services or applications involving the movement of software or configuration parameters
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04W—WIRELESS COMMUNICATION NETWORKS
- H04W4/00—Services specially adapted for wireless communication networks; Facilities therefor
- H04W4/02—Services making use of location information
- H04W4/025—Services making use of location information using location based information parameters
- H04W4/027—Services making use of location information using location based information parameters using movement velocity, acceleration information
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Acoustics & Sound (AREA)
- Human Computer Interaction (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Signal Processing (AREA)
- Artificial Intelligence (AREA)
- Computer Networks & Wireless Communication (AREA)
- General Physics & Mathematics (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Evolutionary Computation (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- User Interface Of Digital Computer (AREA)
- Electrotherapy Devices (AREA)
- Measurement And Recording Of Electrical Phenomena And Electrical Characteristics Of The Living Body (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US17/102,748 | 2020-11-24 | ||
| US17/102,748 US12198057B2 (en) | 2020-11-24 | 2020-11-24 | Context-based model selection |
| PCT/US2021/072521 WO2022115839A1 (en) | 2020-11-24 | 2021-11-19 | Context-based model selection |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| KR20230110518A true KR20230110518A (ko) | 2023-07-24 |
Family
ID=79024726
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| KR1020237016581A Pending KR20230110518A (ko) | 2020-11-24 | 2021-11-19 | 콘텍스트-기반 모델 선택 |
Country Status (7)
| Country | Link |
|---|---|
| US (2) | US12198057B2 (https=) |
| EP (1) | EP4252230A1 (https=) |
| JP (1) | JP7840941B2 (https=) |
| KR (1) | KR20230110518A (https=) |
| CN (1) | CN116601703A (https=) |
| TW (1) | TW202232362A (https=) |
| WO (1) | WO2022115839A1 (https=) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2025023469A1 (ko) * | 2023-07-26 | 2025-01-30 | 삼성전자주식회사 | 시각 정보를 고려해서 인공지능 에이전트를 동작하는 장치 및 방법 |
Families Citing this family (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| KR20220111078A (ko) * | 2021-02-01 | 2022-08-09 | 삼성전자주식회사 | 전자 장치, 사운드 입출력 기기를 포함하는 시스템 및 그 제어 방법 |
| US20220391685A1 (en) * | 2021-06-02 | 2022-12-08 | Arm Limited | System, devices and/or processes for augmenting artificial intelligence agent and computing devices |
| JP7548147B2 (ja) * | 2021-07-19 | 2024-09-10 | トヨタ自動車株式会社 | 配送車両 |
| JP7666430B2 (ja) * | 2022-07-15 | 2025-04-22 | トヨタ自動車株式会社 | 車両用情報処理装置、車両用情報処理システム及び車両用情報処理方法 |
| US12293773B2 (en) * | 2022-11-03 | 2025-05-06 | Robert Bosch Gmbh | Automatically selecting a sound recognition model for an environment based on audio data and image data associated with the environment |
| EP4728443A1 (en) * | 2023-06-15 | 2026-04-22 | InterDigital Patent Holdings, Inc. | Inferencing model selection and management |
| TWI887855B (zh) * | 2023-11-18 | 2025-06-21 | 鴻海精密工業股份有限公司 | 車輛、車輛降噪方法及系統 |
| DE102023132751A1 (de) * | 2023-11-23 | 2025-05-28 | Audi Aktiengesellschaft | Verfahren zum Betreiben eines Sprachdialogsystems sowie Sprachdialogsystem |
| US12542707B2 (en) * | 2024-02-22 | 2026-02-03 | Dell Products L.P. | Facilitating intelligent concept drift mitigation in advanced communication networks |
| TWI902207B (zh) * | 2024-04-08 | 2025-10-21 | 律芯科技股份有限公司 | 基於虛擬麥克風的車用降噪系統及車用降噪系統訓練方法 |
| US12608422B2 (en) * | 2024-07-24 | 2026-04-21 | Robert Bosch Gmbh | Video management system and method for audio event search and classification |
| WO2026053065A1 (en) * | 2024-09-03 | 2026-03-12 | Cochlear Limited | Linguistic context in hearing device systems |
Family Cites Families (52)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4827521A (en) | 1986-03-27 | 1989-05-02 | International Business Machines Corporation | Training of markov models used in a speech recognition system |
| CA2345661A1 (en) | 1998-10-02 | 2000-04-13 | International Business Machines Corporation | Conversational browser and conversational systems |
| DE60108373T2 (de) | 2001-08-02 | 2005-12-22 | Sony International (Europe) Gmbh | Verfahren zur Detektion von Emotionen in Sprachsignalen unter Verwendung von Sprecheridentifikation |
| US7620547B2 (en) | 2002-07-25 | 2009-11-17 | Sony Deutschland Gmbh | Spoken man-machine interface with speaker identification |
| US7640160B2 (en) | 2005-08-05 | 2009-12-29 | Voicebox Technologies, Inc. | Systems and methods for responding to natural language speech utterance |
| US7949529B2 (en) | 2005-08-29 | 2011-05-24 | Voicebox Technologies, Inc. | Mobile systems and methods of supporting natural language human-machine interactions |
| US20070183604A1 (en) | 2006-02-09 | 2007-08-09 | St-Infonox | Response to anomalous acoustic environments |
| US8762143B2 (en) * | 2007-05-29 | 2014-06-24 | At&T Intellectual Property Ii, L.P. | Method and apparatus for identifying acoustic background environments based on time and speed to enhance automatic speech recognition |
| US7877335B2 (en) | 2007-10-18 | 2011-01-25 | Yahoo! Inc. | System and method for learning a network of categories using prediction |
| US8788270B2 (en) | 2009-06-16 | 2014-07-22 | University Of Florida Research Foundation, Inc. | Apparatus and method for determining an emotion state of a speaker |
| US8600743B2 (en) | 2010-01-06 | 2013-12-03 | Apple Inc. | Noise profile determination for voice-related feature |
| US8381107B2 (en) | 2010-01-13 | 2013-02-19 | Apple Inc. | Adaptive audio feedback system and method |
| US9165556B1 (en) | 2012-02-01 | 2015-10-20 | Predictive Business Intelligence, LLC | Methods and systems related to audio data processing to provide key phrase notification and potential cost associated with the key phrase |
| US9575963B2 (en) | 2012-04-20 | 2017-02-21 | Maluuba Inc. | Conversational agent |
| US8463648B1 (en) | 2012-05-04 | 2013-06-11 | Pearl.com LLC | Method and apparatus for automated topic extraction used for the creation and promotion of new categories in a consultation system |
| US20140074466A1 (en) | 2012-09-10 | 2014-03-13 | Google Inc. | Answering questions using environmental context |
| US10134401B2 (en) | 2012-11-21 | 2018-11-20 | Verint Systems Ltd. | Diarization using linguistic labeling |
| US9449613B2 (en) | 2012-12-06 | 2016-09-20 | Audeme Llc | Room identification using acoustic features in a recording |
| US10013483B2 (en) | 2014-01-30 | 2018-07-03 | Microsoft Technology Licensing, Llc | System and method for identifying trending topics in a social network |
| US9466316B2 (en) | 2014-02-06 | 2016-10-11 | Otosense Inc. | Device, method and system for instant real time neuro-compatible imaging of a signal |
| US10410630B2 (en) | 2014-06-19 | 2019-09-10 | Robert Bosch Gmbh | System and method for speech-enabled personalized operation of devices and services in multiple operating environments |
| US10073673B2 (en) | 2014-07-14 | 2018-09-11 | Samsung Electronics Co., Ltd. | Method and system for robust tagging of named entities in the presence of source or translation errors |
| US9412361B1 (en) | 2014-09-30 | 2016-08-09 | Amazon Technologies, Inc. | Configuring system operation using image data |
| US9643511B2 (en) | 2014-12-17 | 2017-05-09 | Samsung Electronics Co., Ltd. | Method and apparatus for estimating state of charge (SOC) of battery in electric vehicle |
| JP5956624B1 (ja) | 2015-02-02 | 2016-07-27 | 西日本高速道路エンジニアリング四国株式会社 | 異常音の検出方法及びその検出値を用いた構造物の異常判定方法、並びに、振動波の類似度検出方法及びその検出値を用いた音声認識方法 |
| US10720240B2 (en) * | 2015-02-26 | 2020-07-21 | Koninklijke Philips N.V. | Context detection for medical monitoring |
| US10482184B2 (en) | 2015-03-08 | 2019-11-19 | Google Llc | Context-based natural language processing |
| JP6556575B2 (ja) | 2015-09-15 | 2019-08-07 | 株式会社東芝 | 音声処理装置、音声処理方法及び音声処理プログラム |
| US9847000B2 (en) | 2015-10-29 | 2017-12-19 | Immersion Corporation | Ambient triggered notifications for rendering haptic effects |
| US9946862B2 (en) | 2015-12-01 | 2018-04-17 | Qualcomm Incorporated | Electronic device generating notification based on context data in response to speech phrase from user |
| US10026401B1 (en) | 2015-12-28 | 2018-07-17 | Amazon Technologies, Inc. | Naming devices via voice commands |
| US10902043B2 (en) * | 2016-01-03 | 2021-01-26 | Gracenote, Inc. | Responding to remote media classification queries using classifier models and context parameters |
| US10373612B2 (en) | 2016-03-21 | 2019-08-06 | Amazon Technologies, Inc. | Anchored speech detection and speech recognition |
| US10304444B2 (en) | 2016-03-23 | 2019-05-28 | Amazon Technologies, Inc. | Fine-grained natural language understanding |
| WO2017187712A1 (ja) | 2016-04-26 | 2017-11-02 | 株式会社ソニー・インタラクティブエンタテインメント | 情報処理装置 |
| US10026405B2 (en) | 2016-05-03 | 2018-07-17 | SESTEK Ses velletisim Bilgisayar Tekn. San. Ve Tic A.S. | Method for speaker diarization |
| JP7006592B2 (ja) * | 2016-06-16 | 2022-01-24 | 日本電気株式会社 | 信号処理装置、信号処理方法および信号処理プログラム |
| CN109997186B (zh) * | 2016-09-09 | 2021-10-15 | 华为技术有限公司 | 一种用于分类声环境的设备和方法 |
| US10705683B2 (en) | 2016-10-31 | 2020-07-07 | Microsoft Technology Licensing, Llc | Changing visual aspects of a graphical user interface to bring focus to a message |
| EP3545374B1 (en) | 2016-11-23 | 2024-11-06 | Alarm.com Incorporated | Detection of authorized user presence and handling of unauthenticated monitoring system commands |
| US10713703B2 (en) | 2016-11-30 | 2020-07-14 | Apple Inc. | Diversity in media item recommendations |
| DK179496B1 (en) * | 2017-05-12 | 2019-01-15 | Apple Inc. | USER-SPECIFIC Acoustic Models |
| EP3424046B1 (en) * | 2017-05-12 | 2020-07-08 | Apple Inc. | User-specific acoustic models |
| US10235128B2 (en) * | 2017-05-19 | 2019-03-19 | Intel Corporation | Contextual sound filter |
| US10311454B2 (en) | 2017-06-22 | 2019-06-04 | NewVoiceMedia Ltd. | Customer interaction and experience system using emotional-semantic computing |
| WO2019070328A1 (en) * | 2017-10-04 | 2019-04-11 | Google Llc | METHODS AND SYSTEMS FOR AUTOMATICALLY EQUALIZING AUDIO OUTPUT BASED ON THE CHARACTERISTICS OF THE PART |
| US10360482B1 (en) * | 2017-12-04 | 2019-07-23 | Amazon Technologies, Inc. | Crowd-sourced artificial intelligence image processing services |
| US10481858B2 (en) | 2017-12-06 | 2019-11-19 | Harman International Industries, Incorporated | Generating personalized audio content based on mood |
| US10832009B2 (en) | 2018-01-02 | 2020-11-10 | International Business Machines Corporation | Extraction and summarization of decision elements from communications |
| CN116738284A (zh) | 2018-01-25 | 2023-09-12 | 意法半导体公司 | 通过感测瞬态事件和连续事件的智能装置的情境感知 |
| US11011162B2 (en) * | 2018-06-01 | 2021-05-18 | Soundhound, Inc. | Custom acoustic models |
| US11195534B1 (en) * | 2020-03-30 | 2021-12-07 | Amazon Technologies, Inc. | Permissioning for natural language processing systems |
-
2020
- 2020-11-24 US US17/102,748 patent/US12198057B2/en active Active
-
2021
- 2021-11-19 KR KR1020237016581A patent/KR20230110518A/ko active Pending
- 2021-11-19 CN CN202180077450.9A patent/CN116601703A/zh active Pending
- 2021-11-19 JP JP2023528472A patent/JP7840941B2/ja active Active
- 2021-11-19 EP EP21831198.3A patent/EP4252230A1/en active Pending
- 2021-11-19 WO PCT/US2021/072521 patent/WO2022115839A1/en not_active Ceased
- 2021-11-22 TW TW110143343A patent/TW202232362A/zh unknown
-
2024
- 2024-12-05 US US18/970,538 patent/US20250103888A1/en active Pending
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2025023469A1 (ko) * | 2023-07-26 | 2025-01-30 | 삼성전자주식회사 | 시각 정보를 고려해서 인공지능 에이전트를 동작하는 장치 및 방법 |
Also Published As
| Publication number | Publication date |
|---|---|
| US12198057B2 (en) | 2025-01-14 |
| JP7840941B2 (ja) | 2026-04-06 |
| US20250103888A1 (en) | 2025-03-27 |
| JP2023550336A (ja) | 2023-12-01 |
| WO2022115839A1 (en) | 2022-06-02 |
| US20220164662A1 (en) | 2022-05-26 |
| TW202232362A (zh) | 2022-08-16 |
| CN116601703A (zh) | 2023-08-15 |
| EP4252230A1 (en) | 2023-10-04 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US20250103888A1 (en) | Context-based model selection | |
| JP7757405B2 (ja) | 適応型サウンドイベント分類 | |
| KR102932652B1 (ko) | 추론 연산을 수행하는 전자 장치 | |
| US11991253B2 (en) | Intelligent layer to power cross platform, edge-cloud hybrid artificial intelligence services | |
| US10657963B2 (en) | Method and system for processing user command to provide and adjust operation of electronic device by analyzing presentation of user speech | |
| US10692492B2 (en) | Techniques for client-side speech domain detection using gyroscopic data and a system using the same | |
| US11995561B2 (en) | Universal client API for AI services | |
| EP4066243B1 (en) | Sound event detection learning | |
| US10573299B2 (en) | Digital assistant and associated methods for a transportation vehicle | |
| KR20230110512A (ko) | 사운드 이벤트 분류를 위한 전이 학습 | |
| JP6619488B2 (ja) | 人工知能機器における連続会話機能 | |
| CN105573128B (zh) | 用户装置及其驱动方法、提供服务的设备及其驱动方法 | |
| KR102795306B1 (ko) | 학습 처리 시스템, 로컬 파라미터 개수 결정 장치 및 방법 | |
| CN115515068B (zh) | 电子设备及其分布式实现方法和介质 | |
| CN115698949A (zh) | 用于ai服务的通用客户端api | |
| US11296942B1 (en) | Relative device placement configuration | |
| CN110400561A (zh) | 用于交通工具的方法和系统 | |
| WO2020207316A1 (zh) | 设备资源配置方法、装置、存储介质及电子设备 | |
| US12229343B2 (en) | System, device and method for real time gesture prediction |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PA0105 | International application |
Patent event date: 20230516 Patent event code: PA01051R01D Comment text: International Patent Application |
|
| PG1501 | Laying open of application | ||
| A201 | Request for examination | ||
| PA0201 | Request for examination |
Patent event code: PA02012R01D Patent event date: 20241104 Comment text: Request for Examination of Application |